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ABSTRACT 

We compile a number of recent observations to estimate the time-averaged rate of formation or "buildup" of 
red sequence galaxies, as a function of mass and redshift. Comparing this with the mass functions of mergers 
and quasar hosts, and independently comparing their clustering properties as a function of redshift, we find that 
these populations trace the same mass distribution, with the same characteristic masses and evolution, in the 
redshift interval < z < 1.5. Knowing one of the quasar, merger, or elliptical mass/luminosity functions, it is 
possible to predict the others. 

Allowing for greater model dependence, we compare the rate of early-type "buildup" with the implied merger 
and quasar "triggering" rates as a function of mass and redshift, and find agreement. We show that over this 
redshift range, observed gas-rich merger fractions can account for the entire bright quasar luminosity function, 
and buildup of the red sequence at all but the highest masses at low redshift (> 1O H M0 at z < 0.3) where 
dissipationless "dry" mergers appear to dominate. This supports a necessary prediction of theories which 
postulate that mergers between gas-rich blue galaxies produce ellipticals with an associated phase of bright 
quasar activity, after which the remnant becomes red. All of these populations, regardless of sample selection, 
trace a similar characteristic "transition" mass reflecting the characteristic mass above which the elliptical 
population is mostly (> 50%) assembled at a given redshift, which increases with redshift over the observed 
range in a manner consistent with previous suggestions that "cosmic downsizing" may apply to red galaxy 
assembly as well as star formation. We show that these mass distributions as a function of redshift do not 
uniformly trace the all/red/blue galaxy population, ruling out models in which quasar activity is generically 
associated with either star formation or is long-lived in "old" systems. 

Subject headings: quasars: general — galaxies: active — galaxies: evolution — cosmology: theory 



1. INTRODUCTION 

Observatio ns motivate the n otion of "cosmic downsizing" 
(as coined by ICowie~ et al. 1996), with the global star forma- 
tion rate declining rapidly below z ~ 2, and the sites of galac- 
tic star formation shifting to smaller masses at lower redshift. 
Moreover, galaxy surveys such as SDSS, COMBO-17, and 
DEEP2 demons trate that the color distribution of galax ies is 
bimodal (e.g.. IStrateva et all 1200 it iBalogh et al] 120041). and 
that this bimodality ext ends at least to z ~ 1 (e.g. jBell etal.1 
l2004tlFaber eFalll^OOlh . 

It is increasingly established that high mass, red elliptical 
galaxi es have older stellar populations than smaller spheroids 
(e-g JCaldwell et all 120031: iNelan et al.ll200H iGallazzieFall 
2006). But, many studies also see a significant popula- 
tion of massive /luminous gala xies i n place ( i.e. assembled) 
by z ~ 2 (e.g., iPapovich et aill2006t iRenzind 120061 and ref- 
erences therein), with measurements of galaxy stellar mass 
functions (MFs) and luminosity functions at redshifts < 
z < 2 favoring either a uniform increase or buildup in the 
numbers of early-type ("red seque nce," RS) galaxies (e.g., 
iBell et all l2004t iFaber et all 120051) or an anti-hierarchical 
scenario in which this "buildup" at z < 1 occurs pr i marily 
at the low-mass e n d of the RS fe undy et all l2005aL I 2006L 
Zucca et all 12005b lYamada et al l 120051: iBorch et ail \20od 
Franceschini et al. 12006 pPannella et al.l 120061: ICimatti et al.l 
2006; Brown et al. 2006). The blue, disk dominated, star 



forming galaxy mass function (dominant at low mass), mean- 
while, remains relatively constant, or perhaps declines to 

1 Harvard-Smithsonian Center for Astrophysics, 60 Garden Street, Cam- 
bridge, MA 02138 

2 105-24 Caltech, 1201 E. California Blvd., Pasadena, CA 91125 



z = 0. As a consequence, the "transition mass," above which 
the red galaxy population dominates the galaxy MF, decreases 
with time, tracing this downsizing trend. There is evidence 
for some evolution at the highest masses as ellipticals grow 
by spheroid-sphe roid or "dry" mergers (Ivan Dokkum 2005; 
Bell et al. 2006a), but this, by definition, proceeds strictly hi- 
erarchically, and cannot account for the movement of mass 
onto the RS in the first place or any buildup in the number 
density of low-mass ellipticals. 

Meanwhile, the discovery of tight correlations between the 
masses of central supermassive black holes (BHs) in galaxie s 
and the bulge or sphe roid stellar mas s (Maaorrian et al.ll998[). 
veloc ity dispersion (Geb hardt et al.l2000l:lFer rarese & Merrittl 
2000) or concentration (Grah am et al.ll200lb implies that the 
formation of galaxies and BHs must be linked. More- 
over, the evolution of the quasar luminosity function (QLF) 
shows a sharp decline after z ~ 2, with the density 
of lower-luminosity AGN peakin g at low redshift (e.g., 
Hasinger, Miyaji, & Schmidt 2005, and references therein). 
To the extent that BH assembly traces galaxy assembly 
(i.e. there is weak evolution in the B H-host mass relation , 
as observed to at lea s t z > 1 by e.g. [ Shields etld] (120031) : 
lAdelberger & Steidell (l2005al) : iPenget all (120061) 1. this im- 
plies early assembly times (z > 1) for many of the most mas- 
sive systems containing Mbh > 10 8 M© BHs. 

A number of theoretical models have been proposed 
to explain the evolution of these populations with red- 
shift, and their corr el ations with one anothe r (e.g. , 
iKauffmann & Haehneltl |200Pi [gbmeryille et all 120011: 
Wvith e & Loebll2003t iGranato et alJl2004t ISc annaniec o et all 
20051: iBaugh et al.l l200l iMonaco & Fontanotl 12001 
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2006). In many of these models, the merger hypothesis 
dToomrd \l97% provides a potential physical mechanism 
linking galaxy star formation, morphology, and black hole 
evolution and explaining these various manifestations of 
cosmic "downsizing." In this scenario, gas-rich galaxy 
mergers channel large amounts o f gas to galaxy centers (e.g., 
iBarnes & Hernquist 1 199 111 19961). fue ling powerful starbursts 
(e.g., Mihos & Hernquistl 1994 1 19961) and buried B H growth 
(e.g. JSanders et alJl988tlBarnes & Hernquisl 19921) until the 
BH grows large enough that feedback fr om accretion rapidl y 
unbinds and heats the surrounding gas dSilk & Reeslll998h . 
leaving an elliptical galaxy satisfying observed correlations 
between BH and spheroid mass. Major mergers rapidly and 
efficiently exhaust the cold gas reservoirs of the progenitor 
systems, allowing the remnant to rapidly redden with a low 
specific star formation rate, with the process potentially 
accele rated by the expulsio n of remnant gas by the quasar 
(e.g., Springel et al. 2005a). This naturally explains the 
observed close associati on between the elliptical and red 
galaxy populations (e.g., Kauffmann et al. 20031). 

In a qualitative sense, the evolution of the characteristic 
mass at which these processes occur can be understood as 
follows. Mergers proceed efficiently at high redshift, occur- 
ring most rapidly in the regions of highest overdensity corre- 
sponding to the most massive galaxies, building up the high- 
mass elliptical MR However, once formed these galaxies are 
"dead", and mergers involving gas-rich galaxies must transi- 
tion to lower masses. 

Recent hydrodynamical simulations, incorporating star for- 
mation, supernova fee dback, and BH growth and feedback 
(Springel et al. 2005b) make it possible to study these pro- 
cesses self-consistently and have lent support to this general 
picture. Mergers with BH feedback yield remnants resem- 
blin g observed ellipticals i n their correlations with BH proper- 
ties (|Di Matteo et al.l2005l). scaling relations (Roberts on et alj 
2005b), colors (Springel et al. 2005a), and morphological and 
kinematic properties (iCox et a l. 2006a b). The quasar activity 
excited through such mergers can account for the QLF and 
a wide range of quasar prope rties at a number of frequencies 
(Hopkins et al. 2005a, 2006b), and with such a detailed model 
to "map" between merger, quasar, and remnant galaxy popu- 
lations it is possible to show that the buildup and statistics of 
the quasar and red galaxy pop ulations are consistent and can 
be used to predict one another (Hopkins et al. 20 06d) . 

However, it is by no means clear whether this is, in 
fact, the dominant mechanism in the buildup of early- 
type populations and quasars and their evolution with red- 
shift. For example, many semi-analytic models incorporate 
quasar trigger ing/feedback and morphologica l transformation 
by mergers ( Kauffmann & Haehnelt 2000; Volonteri et al. 
2003L 120061: IWvithe & Lo eb 2003: ISomerville et all l2004at 
Monaco & Fontanotl 120051: lBoweretal.1 l2006t lLapietalJ 
20061: iMenci et al]|2006l) . However, so me models tie quasa r 
activity directly to star formation (e.g., iGranato et al.ll2004l) . 
implying it will evolve in a manner tracing star-forming galax- 
ies, with this evolution and the corresponding downsizing 
effect roughly independent of mergers and morphological 
galaxy segregation at redshifts z < 2. Others invoke post- 
starburst AGN feedback to suppress star formation on long 
timescales and at relativ ely low accretion r ates through e.g. 
"radio-mode" feedback (ICroton et al.ll20 06). which, if this is 
also associated with optical QSO modes, would imply quasars 
should trace the established "old" red galaxy population at 
each redshift. There are, of course, other sources of feedback, 



with galactic superwinds from star formation presenting an 
alternative means to suppress subsequent star formation, al- 
though the required wind energeti cs are sufficiently h igh to 
prefer a quasar-driven origin (e.g., Benson et al. 2003). Sev- 
eral models invo ke a distinction between "hot" and "cold" 
accretion modes (|Bir nboim & Dekel 2003; Keres et aT]|2005l : 
iDekel & Birnboirrl 2006). in which new gas cannot cool into 
a galactic disk above a critical dark matter halo mass, po- 
tentially supplemented by AGN feedback (lBinneyll2004l) . as 
the dominant distinction between the blue cloud and red se- 
quence, essentially independent of effects on scales within 
galaxies. 

It is also important to distinguish the processes which may 
be associated with the initial movement of galaxies onto the 
red sequence from their subsequent evolution. Once mor- 
phologically transformed by a gas-rich merger, for example, 
mass can be moved "up" the RS (galaxies increased in mass) 
by gas-poor mergers, but it cannot be added to the red se- 
quence in this manner. It also remains an important cos- 
mological question to understand how, once formed, further 
growth of ellipticals by accretion or "cooling flows" may be 
halted. The models above invoke various feedback processes , 
including "radio mode" activity (e.g., ICroton et alJ [2 006). 
cyclic quasar or starburst-driven feedback ( Somervill e et al.1 
1 200 1[ IGranato et alJl2004lBinnevll200l;lMonaco & FontanoJ 
2005), massive entropy inje c tion from a single quasar epoch 
(e.g., IWvithe & Loebl 120031: IScannaoieco & Oh(l2004l) . and 
"hot mode" accretion dBirnboim & Dekel 12003) to address 
this problem. Although critical to our understanding of galaxy 
formation, these processes must operate over timescales of or- 
der the Hubble time for all massive galaxies once formed, and 
therefore are not necessarily associated with the addition of 
mass to the red sequence. As such, the details of these long- 
term suppression mechanisms should be studied in different 
(e.g. already formed elliptical) populations, and are outside 
the scope of this paper. 

Observationally, it is still unclear whether mergers can ac- 
count for the th e buildup of elliptical and/or quasar popula- 
tions (see, e.g., jFlovd et alJ 120041; iRothberg & Joseph! [2006; 
lLotzetalj r2006b. and references therein). Even within the 
context of the merger hypothesis, the relative importance of 
dissipational (gas rich, disk) vs. dissipationle ss (gas poor, 
spheroid- spheroid) me rgers is unclear (e.g., Ivan Dokkuml 
l2005tlBell et al.l2006al) . although all measurements agree that 
the "dry" merger rate is much less than the gas- r ich merger 
rate at all observed red shifts dBell et alJ l2006ah lLotz et all 
2006b: iBell et al.1 l2006bl) . This is essentially related to the 
critical question of whether the buildup of the red sequence 
and elliptical populations is dominated by the formation or 
movement of "new" early-type galaxies onto that sequence or 
instead by the hierarchical assembly of small "seed" early- 
types and substructure formed at high redshift (which will 
also not trigger quasar activity). 

Fundamentally, it is not clear and has not yet been tested 
whether the observed downsizing trends in the transition 
mass, galaxy stellar populations, quasars, and other popula- 
tions are in fact quantitatively the same trend, or merely qual- 
itatively similar. This represents a key test which can dis- 
tinguish between several of the various scenarios above. At- 
tempting to predict the values of this transition mass in an 
a priori cosmological manner is inherently model dependent 
and, at least at low redshift, degenerate between the various 
models described above. However, if mergers are indeed the 
critical link in the process causing the flow of galaxies from 
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the blue to red sequence and triggering quasar activity, then 
it is a strong prediction of these theories, and specifically the 
modeling of Hopkins et al. (2005a-d,2006a-d) that the same 
mergers are responsible for the bulk of the bright quasar pop- 
ulation and the buildup of the new mass on the red sequence 
at each redshift. In other words, these downsizing trends must 
quantitatively reflect one another. 

In this picture, the "transition mass" (M, r ) may represent 
the "smoking gun" of mergers causing the flow of galaxies 
from the blue to red sequence. Therefore, to the extent that 
M, r traces the mass at which the red sequence is being "built" 
at some z, it should also trace the characteristic mass of star- 
forming galaxies merging at that time, and the characteristic 
mass of galaxies hosting quasars which are initially triggered 
by those mergers. Of particular interest, the empirical test of 
this association does not require the adoption of some a priori 
model for galaxy formation. 

Here, we consider the observed M tr over the interval < 
z < 2, and compare it to the characteristic masses of quasar 
hosts and merging galaxies over the same range in redshift. 
We demonstrate that they appear to be evolving in a manner 
consistent with a merger-driven unification model of quasars, 
interacting galaxies, and the red galaxy population. Note 
that we use the term "quasar" somewhat loosely, as a proxy 
for high-Eddington ratio accretion inevitably caused by gas- 
rich mergers, although there may be other triggering mech a- 
nisms as w ell (Sanders et al. 1988; Alexand er et al.l 12005 alrbc 
iBorvs et al. 2005; Hopki ns et alj|2006bl) . Such activity will 
of course be significantly weaker in small systems (especially 
those typical of local ULIRGs) and may not technically qual- 
ify as a "classical" optical quasar (Hop kins et alj [2005b), but 
this distinction is ess entially arbitrary and h as little impact on 
our analysis (see also Hopkins et alJl2006dh . 

We adopt a fl M = 0.3, fi A = 0.7, H = 70k ms~ 1 Mpc~ 1 cos- 
mology. All stellar masses are rescaled to a Sal peterl d 19551) 
IMF. 

2. THE TRANSITION MASS AND BUILDUP OF EARLY-TYPE 
POPULATIONS 

2.1. Defining the "Transition" Mass 

Various studies have used different definitions and terms 
for the mass which separates the dominance of old, red, low- 
SFR elliptical galaxies from that of young, blue, star-forming 
disk galaxies. It is also possible to divide the galaxy pop- 
ulation along any one of those quantities. Although it has 
been established in a number of observational studies that 
the galaxy population is bimodal with respect to color, spe- 
cific star formation rate, and morphology (e.g..lStrateva et al 
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iKauffmann etalj200 3; Balogh et al. 2004; Driv er et al 



2005[). and that this bimodality ex tends at least to z ~ 1 (e.g., 
Bell et al.ll2004l: iFaber et~alll2005l) . it is still possible that the 
various definitions used to separate these bimodal distribu- 
tions could result in a systematically different "separation 
point." 

In what follows, we consider several definitions of the 
"transition" galaxy stellar mass in terms of the MFs ( <p(M)) 
of early and late type systems: the Bun dy et"aT] (l2005ah tran- 
sition mass M tr at which the density of early and late type 
systems are equal, 

Nearly Mtr , Z) = <fr ate Mtr,Z), ( 1 ) 

the iBundv et al.l d2006l) "quenching" mass Mq at which the 
contribution of late types to the total mass function cuts off, 

0iate(M, z) = MM, z) exp(-M/M G ), (2) 



and the Cimatti ^t alj d2006l) "downsizing" mass M50 above 
which 50% of the z = RS MF has been assembled by a given 
redshift, 

0early(M > M 50 , z) > 0.5 ear iy(M > M 50 , Z = 0). (3) 

For each, we consider a division between early and late 
types defined by either a color (i.e. separating galaxies on 
the redshi f t-dep endent RS from the "blue cloud"; see e.g. 
IBell et all (120041) '). star formation rate (generally from de- 
tailed population synthesis modeling, although we include 
looser emission/absorption galaxy spectral type separations), 
or morphology (either by visual morphological identifica- 
tion or concentration/surface brightness) criteria. Above 
these masses, early type galaxies in the red, low-SFR, high- 
concentration, morphologically elliptical "half" of the bi- 
modal distribution dominate the total galaxy MF, and below, 
late type galaxies in the blue, high-SFR, low-concentration, 
morphologically disk-like or irregular bimodal "half" domi- 
nate (generally true also for M50, though it is technically in- 
dependent of </>i ate ). 

We determine these masses from a number of compiled 
type-separated mass functions, shown in Figure Q] (for clar- 
ity, just the early-type MFs are shown). Data in all cases 
are converted t o our adopted cosmology and rescaled to a 
ISalpeteri d!955l) IMF. At all redshifts, only points above the 
quoted completeness limits of each study are shown. Errors 
are as publish ed, and generally accoun t for cosmic variance 
similar to e.g. ISomerville et al.l d2004bl) . A t z = 0, we gener- 
ally adopt the local MF determinations from lBell et al.l (2003) 
from 2MASS+SDSS observations, but find no change in 
our re sults considering e.g. the 2MASS+2dFGRS ICole et al] 
d2001l) d etermination. At hig her redshifts, o ur compilation 
includ es iPozzetti et all (120031 K20; z < 1.3) , iFontana et all 
d200l K20; z < 2.0 ) BundvetalJ (120051 G OODS; z < 
1.4) a nd IBundv et alj (|2006t DEEP2; z < 1.4) iBorch et alj 
d20M COMBO -17: z < 1.0).lFranceschini et all d2006l CDF- 
S; z < 1.4), and lPannella et alJ (120061 FORS Deep+GOODS- 
S; z < 1.15). Each of these considers the separate MFs of 
early and late types, divided according to at least one of the 
criteria above. Although many of these surveys cover a small 
area and thus may be subject to significant cosmic variance, 
our conclusions are unchanged if we restrict ourselves to the 
largest fields, and are in fact most robust for the wide-field 
surveys from COMBO-17 (0.78 deg 2 ) and DEEP2 (3.5 deg 2 ). 

Figure [2] shows resulting "transition" mass from these stud- 
ies as a function of redshift, from the different definitions 
and galaxy type segregation methods above. These sam- 
ples generally derive masses from optical+near-IR spectral 
and photometric fitting. To compare, we also consider the 
evolut ion in M50 (color-selected) determined by Cima tti et alj 
(2006). They compile the B-band early- type (red) galax y 
luminosit y functions from COMBO-17 dBell et al.l |2004|) . 
DEEP2 dWiUmer et alj 120061; IFaber et alj 120051) and the 
Subam/XMM-Newton Deep Survey d Yamada et all 120051) at 
z = 0- 1 . 15 (typical Az ~ 0.05 between luminosity functions), 
and use the redshift-dependent evolution of B-band mass- 
to-li ght ratios determined from fundamental plane studies 
(e.g.Jvan Dokkum & Stan ford 200 3} Ivan der Wei et al.1120051: 



Treu et alJ 120051: Idi Serego Alighieri et alJ 120051: iRenzinil 
2006, roughly similar to mean formation redshifts Zf ~ 3 — 4) 



to correct these to a z = equivalent B luminosity and mass 
(given the z = mass-dependent B/g-band M/L ratios from 
IBell et all d2003l) '). We fit a simple relation of the form 

Mtf oc (1 +z) K (4) 
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FIG. 1 . — Mass functions of early type galaxies selec ted by color, mor phology, or sp ecific star formation rate , from Bell et al. (2003, X 's), Borch et al. 1 2006, 
squares), Bundy et al. 12005a, 2006, filled circles), Pannella et al. 12006, open circles), Franceschini et al. (2006, stars), Fontana et al. (2004, inverted triangles), 
and Pozzetti et al. (2003, triangles). Points are colored by redshift, as labeled, and sh own only above the quoted completeness limits of each study. The data 
have been converted to our adopted cosmology and masses rescaled to a Saloeter (1955) IMF. Comparison suggests that "cosmic downsizing" may apply in some 
sense to early-type galaxy assembly, as well as star formation histories. 



in each panel (for illustrative purposes only, we do not intend 
for this to be considered a rigorous estimate of the evolution 
in these characteristic masses). 

Despite the different selection and type separation meth- 
ods and definitions of a characteristic mass, a nearly identical 
trend with redshift is recovered in every case. In fact, the best- 
fit slopes k for most selection methods and definitions are sta- 
tistically indistinguishable from the cumulative best-fit slope, 
and several of the definitions agree nearly exactly in Figure|2] 
It is also reassuring that the indirect estimates from optical lu- 
minosity functions, which generally involve the largest sam- 
ples and most finely probe the redshift evolution of M50, agree 
well at all redshifts with the MF estimates. This suggests that 
the trend with redshift is real, and that it is independent of the 
potential systematics in sample selection, as these systematics 
can be quite different for the various criteria shown. 

The absolute normalization of the "transition" mass does 
depend systematically on the definition chosen. For example, 
same separation by color or SFR gives a systematic factor ~ 2 
lower mass than separation by morphology, and Mq is sys- 
tematically higher than M, r b y about the same factor for all 
separation methods (see also Bundy et al. 2006). The system- 
atic difference in M, r and Mq can be understood as a conse- 
quence of their definitions (essentially one could define arbi- 
trary Schechter functions for early and late-type systems, and 
as long as the early-type function has a shallower slope and 
larger M», this small systematic offset in the two measure- 
ments would be guaranteed). The systematic offset between 
color/SFR and morphological selection is best demonstrated 
in the detailed compariso n of local color and morphologi- 
cally selected MFs in e.g. iBell et all (120031) . These authors 
find that although the different selection methods preserve the 
same qualitative behavior, and result in early and late-type 
samples which are identical in ~ 80-90% of the included 
galaxies, the color criterion does result in a slightly larger 
number of early-type systems (probably owing to the large 
scatter in blue galaxy colors, with a non-negligible highly 
dust-reddened population), which will push the characteris- 
tic separation mass slightly lower. This may also explain why 
there appears to be a larger scatter between samples at a given 
redshift in the morphologically-defined M, r and Mq, as such 
an effect will be sensitive to a given sample's resolution and 



imaging depth. There may also be an inte resting timescale 
effect, as discussed in Bund v~et~aT1 d2006t) . if galaxies red- 
den onto the red sequence somewhat more rapidly than they 
morphologically relax following mergers (perhaps suggesting 
different mechanisms for morphological and color transfor- 
mation). However, that Mso(z) (depending only on Nearly) is 
similar regardless of selection method suggests that these dif- 
ferences may be an artifact of the selection/identification of 
blue galaxies. In any case, further detailed study of these in- 
termediate objects and comparison between different samples 
is needed to understand these differences. 

These systematic distinctions caution against mixing defi- 
nitions in determining the redshift evolution of these masses. 
However, for our purposes, the systematic normalization scat- 
ter of a factor ~ 2 is not large - this is comparable to the in- 
herent ambiguity in defining a "characteristic mass" of any 
population (e.g. Schechter M*). As long as we are careful 
about the relatively small normalization offsets between se- 
lection criteria, we can safely compare the "transition" mass 
and its evolution with redshift to the masses of other popula- 
tions. 

2.2. The Buildup of Ellipticals and the Physical Significance 
of the Transition Mass 

It has been suggested (e.g. iBundy et al.1 l2005al) that the 
"transition" mass may represent the mass at which the early- 
type MF is "building up" at each redshift, in the sense that 
"new" spheroids are being added to the RS MF at this mass. If 
cosmic downsizing applies to galaxy assembly to any extent 
- i.e. this "building up" extends at higher masses at higher 
redshift (or "peaks" at lower redshift for lower masses), then 
this implies a shift of M50 to lower mass at lower redshift 
(ICimatti et al.ll2006l) . To the extent that the shape of the late- 
type MF does not dramatically change over th is redshift in- 
terval (observed in the samples above and e.g. iFeulner et al.l 
l2003t iDrory et al.ll2004l) this will, by definition, manifest in a 
similar evolution of M lr and Mq. In this scenario, then, M50 is 
directly tied to "downsizing" in the early-type MF. 

Figure [3] considers the evolution with redshift of the num- 
ber density of early-type galaxies of a given M* , from 9.75 < 
log(M*/M Q ) < 12.5. At low M„, the number density declines 
steeply with z (roughly oc (1 +z)~^ with (3 = 2.37). Although 



Co-evolution of Mergers, Quasars, and Red Galaxies 
Transition Mass M )r 



1 2.0 hbolor Selection 
11.5 - 



Quenching Mass M Q 

i 



Mass(50%) M 



50 




S 5 10.5 



0.0 0.5 1.0 1.5 0.0 0.5 1.0 1.5 0.0 0.5 1.0 1.5 



FIG. 2. — Evolution with redshift of the "transition" or "downsizing" mass, at which objects may be in transition from the blue cloud to the red sequence (i.e. the 
characteristic mass at which elliptical populations may be "building up") with different sample selections and definitions of this mass. Left: M tr , the mass above 
which ellipticals dominate the cumulative galaxy mass function, with galaxy types separated by color selection (upper), specific star formation rate or spectral 
(absorption/emission) fitting (middle), and morphological selection (lower). Center: Mq, the mass at which the contribution of late-type gala xies cuts off, with 
the same sample definitions as for M, r . Right: M50, the mass above which the early-type mass function at z is > 50% assembled relative to the Bell et al. 1 2003) 
mass function at z = (i. e. rf>(M > M50, z) > 0.5 <^>(M > M50, z = 0)). Dat a are shown from the mass functions in Fig ureQ] in the same p oint style, color - coded 
by the observed sa mple: IBell et alj ( 2003, black x's), Borch et al. 1 2006 , purple squares), Bun dv et al] {2005a, 2006, red fi lled ci rcles), IPannella et alj 120061 
green open circles), Franceschini et al. 12006, cyan stars), Fontana et al. ( 2004, blue inverted triangles), and Pozzetti et al. (2003, orange triangles). We also 
consider M50 calcula ted in Cimatti et al. (2006, black diamonds) from the luminosity functions of Bell et al. 12004); Willmer et al. 12006); Faber et al. 12005); 
lYamada et al] <2005T) , using the redshift-dependent mass-to-light ratios estimated from fundamental plane studies. Dotted lines in each panel show the best-fit 
trend of the form M* ex (1 +z) K ■ Dashed lines (identical in all panels) show a cumulative best-fit to the M50 data from all samples. Although there are systematic 
factor ~ 2 normalization offsets between different methods which caution against mixing definitions, the various methods all trace a similar mass. Regardless of 
the characteristic mass definition, the sample survey, or the method of type segregation of the samples, a similar trend with redshift is recovered in each case. 



the statistics are poor and variance large at high M», there is 
a significant trend (~ la) for a shallower decline in number 
density at higher masses {(3 ~ 0, i.e. little evolution in number 
density, at M» > 10 12 M Q ). Above z ~ 1, the various samples 
plotted begin to disagree, and the oc (1 + z)~^ functional form 
may not be a good approximation, so we reconsider this, fit- 
ting only the data at z < 1 -0 (z < 0.5), and find the same trend 
at ~ 6cr Aa). 

Figure |4] considers the differential growth of the early-type 
MF in more detail. We show the time-averaged buildup of 
early-type MFs in several redshift intervals from z = 0— 1.5. 
Where a given sample measures the elliptical mass function 
at two redshifts z and z + Az, we differentiate the observed 
elliptical mass function at every mass with respect to the two 
redshifts to obtain the time-averaged rate of creation of ellipti- 



cals of that mass, over that redshift interval. In total, we show 
four redshift intervals: z - 0.0 - 0.3, 0.3 - 0.7, 0.7 - 1 .0, and 
1.0- 1.5. Of course, we only compare mass functions mea- 
sured with the same technique and sample in the same study, 
since systematic offsets in methodology could severely bias 
such an estimate. Although the scatter is large (especially at 
low masses), the observations all trace a similar elliptical for- 
mation rate as a function of mass, with a similar break traced 
in the different samples at each redshift. The shape of this 
function is not the same as that of the early type mass func- 
tions - i.e. we are not simply recovering the fact that the mass 
function builds up uniformly over these intervals. 

At low redshift (z < 0.3), comparison with the early-type 
MFs in Figure[TJshows that sub-M* (M < 10 n M Q ) ellipticals 
are "building up" in number density by ~ 7— 15% per Gyr, 
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FIG. 3. — Number density of early-type galaxies in different mass bins as a function of redshift, relative to that at z. = from Bell et al. 1 2003, color-selected), 
from the samples in Figure [2] (same style). Dotted horizontal line shows 50% of the z = value, vertical horizontal lines show the redshifts at which the lower 
and upper limits of each mass bin correspond to the best-fit Msq(z) (dashed line in Figureff}. Dashed lines in each panel show the best-fit trend of the form 
oc (1 +z)~P, with the labeled /3. Although there is considerable variance at high masses, a mass-independent galaxy density evolution (3 can be ruled out at ~ la. 
A steeper (3 at low mass implies that a large fraction of these galaxies are added at lower redshift than galaxies of higher mass. 



whereas the most massive systems build up by only ~ 1% 
per Gyr. In other words, the most massive systems are not 
building up (either via star formation or assembly of stellar 
populations) at a significant rate at low redshift. At high z, 
there is a marginal shift of this function to higher masses. 
By z ~ 1, comparison with the corresponding early-type MFs 
implies that systems with M* > lO n M0 are building up by 
~ 20-50% per Gyr. The best-fit Schechter functions plotted 
in each redshift interval reflect this, with the Schechter M* 
shifting from 11.02 ±0.11 (z < 0.3) to 11.51 ±0.07 (1.0 < 
z<l-5). 

The low-z growth estimate of ~ 1% per Gyr in the most 
massive systems is in e xcellent agreemen t with that from 
iMasjedi et al. (2006) and lBell et aiTd2006bl) determined from 
local red galaxy (spheroid-spheroid or "dry") merger rates. 
In detail, observations suggest that the typical massive red 
galaxy und ergoes ~ 0.5 — 1 major dissipationle ss mergers 
since z = 1 (Ivan Dokkumll2005t iBell et al.l l2006a). If this is 



representative, it is trivial to predict the corresponding rate 
of buildup of the elliptical population, assuming every red 
galaxy undergoes this number of major (~ 1 : 1 mass ratio) 
mergers in this time. This is shown in Figure |4] calculated 
from the local MF of lBell et ail (120031) . Although these merg- 
ers appear to be important for building up the most massive 
galaxies at low (z < 0.3) redshift, their contribution cuts off 
completely below ~2x 10 n M Q (and will cut off at higher 
masses at higher z, generally about ~ 2 times the break in 
the "buildup" mass function). Such mergers move galaxies 
from the low-mass end of the elliptical mass function to the 
high mass end, and since the low-mass slope of the elliptical 
mass function is not steep, this can only decrease the number 
density of low-mass objects. By definition, then, dry merg- 
ers cannot account for the (substantial) buildup of total mass 
on the RS nor the buildup at low and intermediate masses. If 
these lower-mass ellipticals are formed by gas-rich mergers, 
then there must be at least ~ 2 times as many gas-rich mergers 
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FIG. 4. — Time-averaged rate of formation of elliptical galaxies, obtained by differentiating the observed elliptical mass functions in Figure [T] with respect to 
time (colored points; style as in Figureff). Results are shown over a number of redshift intervals, as labeled. Dotted lines show typical completeness limits at 
each redshift. Short-dashed, long-dashed, and dash-dotted vertical lines show M50, M tr , and Mq from Figure|2](color-selected fits) at each z- Solid line in each 
panel shows the best-fit Schechter function (to points above the completeness limits quoted for each sample), with the shaded range showing the Icr range of the 
best-fit M». High mass ellipticals appear to preferentially build up at higher redshifts. The characteristic masses in Figure[2]provides a reasonable proxy for the 
characteristic M» being added or "built up" in the early-type galaxy mass function at each redshift, as both evolve to higher masses at higher redshift. In the 
upper panel, thick dashed lines show the maximal contribution from spheroid-spheroid mergers if all undergo ~ 0.5 — 1 (lower and upper lines) such mergers in 
that redshift interval (at or below the break at each z, a significant dissipationless merging fraction will lower the number density at low M, where it is observed 
to rise). 



moving new galaxies onto the elliptical mass function as there 
are dry mergers (i n fact, the actual observed ratio is ~ 10 : 1, 
iBellet al.l d2006ah \ 

3. THE TRANSITION MASS AND CHARACTERISTIC GALAXY 
MASSES 

Figure [5] compares the "transition" mass determined above 
with the characteristic masses (Schechter M„) of the red, blue, 
and all galaxy populations. For clarity, we show just M50, 
as it is the most well-determined of the masses in Figure 12 
as well as being most robust with respect to sample defi- 
nitions/selection, and further it has the most direct physical 
interpretation (as it is not degenerate with blue cloud evo- 
lution). Our conclusions here and subsequently, however, 
are unchanged regardless of the mass definition from Fig- 
ure |2] The characteristic masses of red, blue, and all galax- 
ies are nearly constant with redshift, with at most a marginal 
(~ 0.2 dex) increase from z = 0- 1 ; inconsistent with their fol- 
lowing the strong trend seen in M50 at > 10cr (independent 
of normalization). Likewise, comparing the shape of the rate 
of elliptical buildup in Figure [4] with these galaxy mass func- 
tions at the same redshift shows that they do not trace the same 
mass distribution as a function of redshift. We can therefore 
(perhaps unsurprisingly, given the definitions employed) rule 
out at high significance the hypothesis that "transition" mass 
objects are uniformly/randomly drawn from a "parent" popu- 
lation of normal galaxies of either early or late (or both) types. 

It may appear that the strong trend in M50 (i.e. the mass 
above which the RS MF is > 50% assembled at z) is incom- 
patible with the weak trend in of red galaxies (as e.g. no 
change in M* would imply uniform buildup of RS populations 
at all masses). This, however, is an artifact of the Schechter 
function fit. For example, given a local early-type Schechter 



function MF with = (j>o, a, and = Mq and a similar MF 
at z with <fi* = <f> v the same a (commonly assumed in fitting), 
and = Mq (1+5), one obtains 

1 \-i 

M 50 (z) = M ( ' 



1 + 5 



In 



26 7 



(i+sy 



(5) 



which is quite sen sitive to 5 and, for the observed values of 
0*(z), M *(z) (e.g.. iBorchetal .1120061) predicts a ~ l-2dex 
evolution in M50 similar to what we find. 

In a more physical sense, the local RS MF is the sum of the 
RS MFs built up over various intervals, each of which resem- 
bles a Schechter function (see Figure @). Consider the sum 
of two Schechter functions with identical a and </>„, but one 
having lower by 1 dex. Fitting this to a Schechter function 
over the range M» ± 1 dex (± 1 .5 dex) yields a best-fit with M* 
only lowered from the higher value by ~ 0.1 dex (0.05 dex). 
This owes to the steep fall in <fr{M) atM> M*, which forces 
the combined fit to retain the high M* value. In further detail, 
if we imagine the rate of buildup d<fi(M)/dt(z) (Figure |4|i is 
a Schechter function with constant a and normalization, but 
an evolving M* = M 5 q{z) (adopting the best-fit trend M^q(z) 
shown in Figure |5), then fit the integrated </> ea riy(Af, z) to a 
Schechter function (fitting over the range M» ± 1 dex), we 
obtain only ~ 0.15-0.20dex evolution in the early-type M* 
from z = to z = 1 , despite the more than order-of-magnitude 
evolution in M5o(z). Thus, although the strong evolution in the 
"transition" mass with redshift rules out its being representa- 
tive of the general elliptical population, it is not inconsistent 
with the weak evolution in the early-typeM*, even if M50 does 
represent the characteristic mass at which "new" galaxies are 
being added to the red sequence. In other words, weak evolu- 
tion in M* of red galaxies does not rule out strong evolution in 
the characteristic masses being "built up" on or added to the 
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FIG. 5. — Comparison of the "downsizing" or "transition mass" with characteristic galaxy masses. Upper left shows M50 (points; from color-selected samples) 
and best-fit trend (dotted line) from FigurefJ] Other panels show the best-fit Schechter function M„ to the all-galaxy mass functions, and color-selected red and 
blue galaxy mass functions (points as in Figureff). The trend in M50 (and M, r , Mq) does not trace the all, blue, or red galaxy populations - i.e. "transition mass" 
systems are not uniformly drawn from any of these populations. Note that the trend in M50 is not incompatible with that in M* of red galaxies, as the Schechter 
function parameterization is such that an order-of-magnitude change in M50 at a constant rate of elliptical "formation" implies only a ~ 0.1-0.2 dex change in 
M* (which can be further offset by a small dry merger rate). 
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4. THE TRANSITION MASS AND MERGERS 



We next consider observed merger MFs. We compile the 
local (z < 0.2) pair-selecte d major (within ~ 1 mag) merger 
luminosity functions from IXu et all (12004 2MASS) in K- 
band and Tole do et al] ([1999) in B-band, as well as the mor- 
phologically identifi ed merger/intera c ting g alaxy luminos- 
ity functions from iBrinchmann et all (1998, CFRS+LDSS; 
z < 1) in fi-band and iWolfetalJ (12001 GEMS+GOODS; 
Z ~ 0.7) in the near-UV (280 n m), and mass functions 
from IConselice et alJ (12001 12001 HDF-N and HDF-S; z ~ 
1-3) and Bundy et al. (2006, GOODS+DEEP2; z < 1.4). 
Where the MFs are not directly measured, we rescale the 
luminosity functions to mass functions using the mass-to- 
light ratios of ongoin g mergers (as a function of M*) from 
Hopki nset alJ d2006dl see Table 1). T hese are calculated 
from t he population synthesis models of Bruzu al & CharloJ 
J20031 given the distribution of star formation histories 
during mergers determined from several hundred numeri- 
cal simulations that include star formation, supernova feed- 
back and metal enrichment, and black hole accretion and 
feedback (see Hopkins et al. 2006d for details; this is es- 
sentially a second-order improvement on the typical empir- 
ically adopted simplified tau+burst models for these M/L). 
These should be reasonably robust: they have also been 
checked directly in the bands of interest he re against the 
measu rements of M/ L ratios in local UL IRGs dTacconi et alJ 
l2002h . pair samples ( iDasvra et alj 12006). and recent merger 



remnants (Rom berg & Jo seph 2004), and give good agree- 
ment dHopkins et all l2006di Fi gures 1 & 4). Furthermore, 
HopkinsetaLl d2006d, Figures 8 & 9) demonstrate that they 
can be reliably used to convert merger luminosity functions to 
mass functions (in exactly this manner) for all samples above 
where both are measured. In any case, our subsequent results 
are unchanged (albeit their significance reduced given the lim- 
ited data) if we consider only the morpholog ically identified, 
directly mea sure d merger ma s s fun ctions of IConselice et alj 
d2003l 120051) and lBundv etal] (120061) . 

Figure|6]shows several (the most well-constrained) of these 
MFs as a function of redshift, with the best-fit Schechter func- 
tions. Since constraints are weak at the faint end (and sys- 
tematic uncertainties large; discussed below), we consider fits 
with fixed faint-end slopes a, as well as allowing a to vary. In 
all cases, there is a qualitatively similar trend for the Schechter 
function to increase (by ~ 0.5 -0.7 dex from z = 0— 1). 
Direct comparison of the MFs demonstrates that this does not 
necessarily mean the number of mergers at the high-mass end 
increases monotonically with redshift (changes in 0* can off- 
set the increase in M*); this does however imply that the rela- 
tive merger history /MFs favors higher mass mergers at higher 
redshifts. Although it is not important for our comparison, 
there is also a marginal trend for (f>* of mergers to increase 
with redshift relative to 0* of the entire galaxy pop ulation, 
but thi s trend is largely driven by the low from lXu et alj 
(120041) . 

Figure [7] plots these Schechter values, as well as those 
determined from the other merger mass/luminosity functions 
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- Comparison of observed merger mass func tions from 
= 0.5-0.8 (blue), and z = 0.8 -1.2 (red), and lWolfetal 



Xu et all J20041 black stars) at z < 0.2, IBundv et all <2005a, 2006, circles) at z = 0.2 - 0.5 
42005L green diamonds) at z - 



Fig. 6. 

(purple), z = 0.5 - 0.8 (blue), and z = 0.8 - 1 .2 (red), and lWolfetal] j2005l green diamonds) at z = 0.7 - 1.0 (upper & lower z points from GEMS and GOODS, 
respectively). Panels show the merger mass functions (right), with best-fit Schechter functions (lines of corresponding color), and best-fit Schechter function <p t 
(left) and M» (center). Because constraints on the faint-end slope are weak, upper panels fix its value a = -0.7, middle p anels fix a = —1. 0, and lower panels 
allow a to be fit Open points in the le ft panels show rf>t of the entire galaxy popul ation as compi l ed in Faber et al. 12005) from Bell et al. (2003, SDSS; star), 
IMadewick et all 120031 2dF; diamond). [Bell et alJj2004l COMBO-17; circles), and lWillmer et alj 120061 DEEP2; squares). Dotted lines in center panels show 
the fitted M^oiz) (lower) and M tr (z) from Figure [2] (color- selected). Regardless of the choice of a, there is a trend for the characteristic merger mass M* to 
increase with redshift in a manner similar to M tr . Interpretation of <f>* is more ambiguous, but there is a suggestion that it increases relative to the </>„ of the 
galaxy population. 



we compile, as a function of redshift, compared to the char- 
acteristic "transition" mass (M50). The characteristic masses 
from pair and morphologically-selected samples, as well as 
direct MFs, optical, near-IR, and near-UV luminosity func- 
tions agree surprisingly well at overlapping redshifts, at least 
up to the z > 2 estimates which are strongly affected by cos- 
mic variance dConselice et alj 120051) . which further suggests 
the typical merger M/L ratios used are reasonable. There is a 
significant (> 3a) trend for the characteristic masses of merg- 
ers to increase with redshift. As with the "transition" mass 
population in Figure [5] this trend rules out at > 3 a the hy- 
pothesis that mergers are randomly/uniformly drawn from the 
all or red galaxy population. Whether mergers uniformly trace 
the blue galaxy population is less clear; the values plotted are 
inconsistent with this hypothesis at ~ 3.5a, but the trend alone 
(i.e. allowing for a systematic normalization offset) is incon- 
sistent at only ~ 2a. The values/trend of the merger MF M* 
as a function of redshift are, however, similar and statistically 
consistent with M50 (and even more similar to M tr and Mq, 
see Figure \15[. 

We can consider in greater detail if observed merger 
mass/luminosity functions are consistent with the observed 
buildup of early-type populations by examining the complete 
mass functions as a function of redshift. Figure [8] reproduces 
Figure @] but overlays the observed merger MFs from Fig- 



ure [6] at the appropriate observed redshifts. Since there is 
considerable ambiguity in converting an observed merger MF 
to a merger rate, we renormalize (vertically only; i.e. divide 
out an appropriate timescale) the observed MFs arbitrarily 
such that we can focus here just on the unambiguous mass 
distribution (although we will consider the issue of absolute 
rates/normalization shortly). The agreement is striking: inde- 
pendent of the systematics in understanding merger rates, the 
observed distribution of mergers as a function of mass and 
redshift traces and is consistent with the buildup/addition of 
galaxies to the red sequence at all masses and redshifts ob- 
served. 

Although systematically uncertain, we should also com- 
pare the implied merger rates (i.e. vertical normalization in 
Figure [8}. In other words, even if the buildup of early-type 
populations traces the merger mass distribution, are there the 
appropriate total number/rate of mergers to account for the 
growth of the RS MF (assuming mergers are the agent of this 
"buildup")? To estimate this, we assume every elliptical for- 
mation/addition event in Figure |4] indeed owes to a merger, 
which is observable as such for some amount of time t meiser 
(until morphological disturbances such as tidal tails fade be- 
yond typical surface brightness limits). This yields the ex- 
pected merger MF, (f>(M merga ) ~ fmergePC^gai)- Given the 
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FIG. 7. — Left: "Transition" mass M50 from all sample selections shown in Figure [2] (black points in same style), and the b est-fit trend with redshif t (black 
dotted line), compared to the ch aracteristic mass M , from mer ger mass and lumin osity functions (red points; see Figure|6j, from Xu et al. 1 2004, stars), Wolf et al. 
)200a filled diamonds).IBundv et alj f2005a. 2006j, circles), IConselice et al.H2003l HDF-S. crosses; HDF-N, open diamonds), Toledo et al. ( 1999, triangles), and 
Brinchmann et al. ( 1998, squares), with the best-fit trend of the form M» oc (1 +z) K (red dotted line). The M* values shown allow the merger mass function 
faint-end slope a to vary freely, but a similar result is obtained fixing a to match the early-type or all-galaxy values (see Figure|6). Right: Same, but for clarity, 
only M50 from color-selected samples and the best-constrained 1 /2 of merger M* values are shown. Despite the small sample sizes, the characteristic mass of 
merger mass functions increases with redshift at > 3<r (implying mergers are not simply drawn from the approximately constant M* all, blue, or red galaxy 
populations) and is consistent with the value and evolution of M50 as a function of redshift, as in Figure[8] 




log(M ;t r4/M ) 

FIG. 8. — Time-averaged rate of buildup of early-type populations, as in Figure|4] but with th e observed merger mass functions from Figure |6]overlaid (black 
points, style as in Figure[6}. Merger mass function points at z ~ 0.7-1.0 from Wolf et al. (2005) show the effects of systematic differences in imaging depth and 
survey area, from GEMS (squares), GOODS (x's), and GEMS+GOODS (diamonds). Since the merger timescale is observationally undetermined, we compare 
the mass distribution in mergers and the early-type buildup (i.e. merger mass function are renormalized for direct comparison). The time-averaged buildup of 
elliptical populations traces a similar mass distribution to that of observed merger populations at all redshifts observed, and not the same mass distribution as that 
of the all, red, or blue galaxy population as a function of redshift (compare Figure]5J. 
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galaxy MFs from which we calculated these rates in the first 
place, it is then trivial to estimate the expected merger fraction 
as a function of mass. 

Figure [9] compares this estimate, adopting a charac- 
teristic f mer ger = 0.5 Gyr, with observed merger fractions 
as a function of mass and redshift. This timescale is 
rough ly expected from num erical simulations of mergers 
(e.g., [Robertso n et al.ll2005al) . dynamical friction considera- 
tions dPatton et al.ll2002l ). or more detailed estimates of obser- 
vation al selection effects a s a function of merger stage (see, 
e.g., [Hopkins et al. 2006d, for a detailed discussion of these 
issues). We consider the merger fraction above two represen- 
tative stellar mass limits, AT, > 1O 1O M andM* > 10 n M Q , 
but caution that these are not, for the most part, mass-selected 
samples, so the mass limits shown in Figure[9]are only broadly 
applicable. We calculate the merger fraction as a function 
of mass directly from the merger mass functio ns of IXu et ail 
( 2004): IWolfetaT] d2005h : lBundv et all d2005ah shown in Fig- 
ure[6l at redshifts where the samples are complet e to the given 
mass li mit (this essentially excludes the z > 1 iBundy et alJ 
(2005a) merger mass function). 

The observed merger fractions are consistent with this es- 
timate at all redshifts. The buildup of elliptical populations 
does suggest that merger fractions should increase as a func- 
tion of redshift, but we note that the effect is quite weak. 
If, for example, the characteristic merger timescale decreases 
with redshift in the same manner as halo dynamical times 
(at fixed mass), oc (1 + z)~ 3 ^ 2 (decreasing the expected merger 
fraction we calculate by this amount), the expected increase 
in merger fraction with redshift becomes marginal (factor 
< 2 by z ~ 1.5). Both cases, however, are consistent with 
the present observations (at these redshifts, within the factor 
~ 2 typical uncertainties). There is also marginal evidence 
for steeper evolution in merger fractions with redshift in the 
higher -mass cut we consider, evide nce for which is also seen 
in e.g. lConselice et all d2003[ [2005). but we caution both that 
this trend appears only when different samples are combined, 
and that it will be "washed out" by the increasing importance 
of dry mergers at high masses and low redshifts (see also 
Figure [13}. Future studies which can separate gas-rich and 
gas-poor merger populations and track the merger fraction as 
a function of redshift and mass can provide a substantially 
stronger test of these trends. 

Caution regarding systemati c uncertainties in m erger pop- 
ulations is still warranted. iLe Fevre et alJ (l2000h find that 
pai r and morpholog ical selection criteria yield similar results, 
but lLin et all (12004) see significant disagreement, which may 
be the result of systematic ident ification of "false" (i.e. non- 
merging) pairs at low redshift dBerrier et al.l l2006h . Fortu- 
nately, the characteristic merger mass (or luminosity) does 
not appear to change dramatically with selection method even 
though the time spent in a given phase (a nd thus 6 * or me rger 
fraction) may. For ex ample, the data of iLin et alJ d2004l) and 
IConselice et alJ d2003l) do yield a similar characteristic merg- 
ing galaxy luminosity L*, despite finding different merger 
fractions. One might also wonder whether the natural ten- 
dency of a flux-limited sample to select brighter systems at 
higher redshift might lead one to infer an increasing mass 
scale regardless of the underlying mass distribution. However, 
Figure[6]demonstrates that the completeness limits for most of 
the samples we consider are generally well below the "transi- 
tion" mass, and similarly below the b reak in the c orre spond- 
ing mass function. Further, although IWolf et all d2005l) find 



that the observed number of faint mergers depends on selec- 
tion effects (see Figure[HJ which shows the increase in number 
of low-mass mergers when increasing imaging depth and de- 
creasing field size by an order of magnitude, from GEMS to 
GOODS), this does not significantly effect the merger M» or 
change our comparison in Figure [8] Still, these effects must 
be accounted for in any comparison of fitted MFs. 

It is furthermore true that the exact appropriate value of the 
duration of observable merger activity (f me rger) is not well- 
determined, and will in detail depend on the sample, mass 
limit, and redshift, but for our purposes these effects (amount- 
ing to a systematic factor ~ 2 uncertainty) are generally com- 
parable to or smaller than the scatter in the observations in 
Figure [9] Preliminary estimates of the observable merger 
timescale based on compa rison with autom ated nonparamet- 
ric classification schemes dLotz et al.ll2004h suggest, perhaps 
surprisingly, relatively weak trends with redshift (at least at 
Z < 2; see the discussion in Hopkins et al. 2006d), but lack- 
ing a complete cosmological framework from which to pre- 
dict observable merger properties, Figure [9] should be taken 
with the strong caveat that the relative normalizations of 
galaxy buildup and observed mergers depends systematically 

On ^merger- 

5. THE TRANSITION MASS AND E+A GALAXIES 

Detailed studi es of "E+A" (or "K+A") galaxies 
dDressler & Gunnl 119831) . with characteristic post- 
starburst stellar populations indicating a substantial 
but rapidly quen ched star formation epoch in the last 
~ 0.1 - 0.5 Gvr dCaldwell et alJ fl996t ICouch & Sharpies! 
119871: lOuintero et al.ll2004l) . have found ubiquito us evidence 
of morphological disturbances and tidal tails dSchweizerl 
Il996t iBlake et all l2004t iGotol 120051). which together with 
their environmental (e.g., Zabludo ff et al J 1 19961: lGoto l |2005l) 
and structural/kinematic properties dKelson et al.l 120001 : 
iNorton et al.ll200ll: iTran et alJl200H Ivan der Wei etMWMi 
imply their formation in mergers and evolution into typical 
early-type galaxies. We therefore consider whether E+A 
galaxies, presumably recently formed in mergers, trace any 
"downsizing" trend. 

Figure [10] compares Mso and the masses of observed E+A 
galaxies as a function of redshift. At low redshift, sizable 
samples exist, and we show the characteristic Schechter func- 
tion M* of E+A populations. At higher redshift, samples are 
extremely limited, and we can only plot the masses of individ- 
ual syst ems. The points a s plotted appear to "downsize," as 
noted in lTran et alJ ([2003 ), but this trend could well be com- 
pletely driven by survey flux limits. Lacking volume-limited 
samples or complete E+A MFs at high redshift, we can only 
presently say that the E+A data are not inconsistent with the 
downsizing in the "transition" mass or any of the other hy- 
potheses considered herein. 

6. THE TRANSITION MASS AND QUASARS 

6.1. The QLF Break Expected from the Transition Mass 

If the formation/movement of galaxies on the RS is asso- 
ciated with a quasar "trigger" (for example, through quasar 
feedback being an agent of reddening, or both being asso- 
ciated with a merger), then the observed quasar luminosity 
function (QLF) should reflect the rate of elliptical forma- 
tion/buildup shown in Figure [4] - indeed, in such a model, 
each quasar "broadcasts" a galaxy moving to/forming on the 
red sequence. (If the quasar "lifetime" were of order the 
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FIG. 9. — Expected merger fraction as a function of redshift, if the buildup of the early-type mass functions in Figure [6]is entirely a result of mergers 
movin g galaxies onto the red se quence (colored point s as in Figure[6). B lack points sh ow observed mer ger fractions as a function of reds hift, fromlPatton et alj 
1 2002, filled inverted triangles), Conselice et al. (2003, filled circles), Bundy et al. 1 2004, filled triangles), Lin et al. 1 2004, open diamonds), Xu et al. (2004, open 
stars), De Propris et al. (2005, open circles), Cassata et al. 1 2005, filled diamonds). Wolf et al. (2005, filled stars), Bundv et al. (2005a, open triangles), Lotz et al. 
1 2006a, open inverted triangles), ILotz e t al. (2006bj, open squares), and [Bell et al. 1 2006b, filled squares), Results are shown for two (approximate) minimum 
stellar mas s limits, as labeled. Solid line shows the expected gas-rich merger fraction if all bright quasars are triggered in mergers, using the same modeling from 
Figure [T2l to det ermine the quasar- parent mass function from the observed quasar luminosity function. Dashed line adds a constant fraction (observed 0.015; 
Bell et al. 2006a; Lotz et al. 2006b) of dissipationless (spheroid-spheroid) mergers. An observable merger timescale of 0.5 Gyr is assumed. The expected merger 
fractions from the observed buildup of early-type mass functions and the quasar luminosity function agree reasonably in their normalization and evolution with 
observed merger fractions. There are sufficient mergers to account for both populations, and little room for a large fraction of mergers which do not produce a 
remnant elliptical or trigger quasar activity. Dissipationless mergers are generally a relatively small effect as is observed, but may be important for the buildup of 
the most massive systems at low (z < 0.5) redshifts. 

Hubble time, of course, then the QLF would reflect the in- 
tegrated/established early-type population, but observations 
constrain it to be much less at all redshifts, e.g. Martini 2004; 
Hopkins, Narayan, & Hernquist 2006; such that the QLF 
tracks the rate of "triggering.") 

It is straightforward to compare these. A spheroid of 
mass M„ hosts a black hole of mass M Bii = /xM» ss 
0.001 : iMagorrian et all [l998t iMarconi & Hunll l2003h . con- 
fir med by direct observations at all redshifts of interest (z 5, 
2:IShields et al.ll2003l:lPeng et "aT]l2006l lAdelbereer & Steidel 
l2005al) . A "quasar" event is essentially defined by "ignition" 
of the black hole for a brief ti me (tg < 10 7 yr from various 
observations; see lMartinill2004 and references therein) near 
the Eddington limi t L = 3.3A x 10 4 L^ (M BH / M W ) (where A = 
L/L E d d ~ 1; e.g., iMcLure & Dunlopl 12004: iKollmeier efafl 
120051) . Thus, the formation or movement to the RS of a 
spheroid of mass M* would be associated, in this scenario, 
with a short-lived quasar of luminosity 
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FIG. 10. — Observed mass of E+A galaxies (colored points, as labeled) 
compared to the transition mass M50 (black points, as in Figure [7] from 
color-selection) as a function of redshift. We show the fi t ted M * of the E+A 
mass f unction where available (points from Norton et al. (2001); Bla ke et all 
(2004) with error bars, as labeled), but owing to limited sampl es otherwise 
show the masses of individual E+A galaxies observed in clusters (Fr anxl 19931 : 
ICaldwell et alJ1996j] van Dokkum & Stanford 2003, both stellar and dynam- 
ical masses from ITran et al. 12003) are shown, as labeled) and in the field 
(Tran et alj|2004T) . The masses of E+A galaxies appear to trace the charac- 
teristic masses of mergers and the transition mass as a function of redshift, 
but lacking larger samples at z > from which to determine a full E+A mass 
function, it cannot be determined whether or not this is merely a selection 
effect. 
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This simple effective conversion for bright quasars is sup- 
ported by both numerical simu lations of quasars and galaxy 
mergers (Hopkins et al. 2006tj) a nd direct comparison of 
quasar and host galaxy luminosities dVanden Berk et al.l2006l; 
iPeng et aDl2006t iRichards et a l. 2006b). There is, of course, 
some uncertainty and observed scatter in the host galaxy-BH 
mass correlation and bright quasar Eddington ratios, but it is 
constrained to a factor ~ 2, comparable to the uncertainty in 
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the observed M tr . 

If this is the dominant mode of quasar triggering, then al- 
though the exact normalization of the QLF (number of ob- 
served quasars) will depend on the "duty cycle" 6 and quasar 
lifetime fg, the break M* in the host/source mass function 
(break in the rate of "formation" of early-type galaxies cal- 
culated in Figure|4|i will translate directly to a break L* (from 
Equation [6} in the QLF. If S and/or tg are complicated func- 
tions of mass, luminosity, or redshift, they might change the 
slopes of the resulting QLF, but will not move the break lo- 
cation L*. Thus, L* directly tracks the characteristic mass of 
the host population. Put another way: essentially all observed 
L > L* quasars have A ~ 1, i.e. define a characteristic active 
black hole mass Mbh oc L», and since M* ~ M^h/ A 1 at a H red- 
shifts of interest, the characteristic host mass M» of quasars is 
well-defined at each z. 

Figure QT| compares the QLF characteristic luminosity ex- 
pected (Equation [6]l from objects of the "transition" mass 
(M50) and the observed characteristic QLF "break" luminos- 
ity L* as a function of redshift. The break L» is deter- 
mined in the standard fashion, fitting the observed QLF at 
each redshift to a double power-law. We show L* measured 
from optical, soft X-ray, and hard X-ray studies, each con- 
verted (to enable direct comparison) to a bolometric lumi- 
nosity L using a standard observationally-derived bolomet- 
ric c orrection (template quasar SEP) and re ddening correc- 
tion JHopk ins. Richards. & Hernquistl 120071 and references 
therein; note that adopting the less recent bolometric correc- 
tions from Elvis et al. 1994 or Marconi et al. 2004 yields 
ne arly identical results). We also show th e break determined 
bv lHopkins. Richards. & Hernquistl d2007l see their Table 2), 
who compile a large number of QLF measurements through 
the mid and near-IR, optical, near-UV, soft and hard X-ray, 
and soft gamma ray and use these to directly determine the 
bolometric QLF. In any case, the observed L* is robust; in 
fact, the (typical factor ~ 2) discrepancies in L* owe mostly 
to the data binning and fitting function, and a direc t com- 
parison of the data in Hopkins, Richard s". & Hernquistl (|2007|) 
shows they trace a similar turnover/break. We also note that 
the existence of a break is unambiguous (detected at ^ 10<7 
in most of the samples in Figure [TTJ, regardless of whether 
it is sharp (as expected for a double power-law fit) or ex- 
hibits som e higher-order curvature (as for a Sch echter func- 
tion; e.g., IWolfet alJlIOOl IRichards et aLll2005h . It is unaf- 
fected by questions of completeness, as the X-ray surveys are 
typically complete to ~ 2 orders of magnitude in lum inos- 
ity below L» (e.g., Hasinger, Miyaji, & Schmidt 20051). The 
break luminosity also increases with redshift (at least to z ~ 
2), as has long been recognized in quas a r surveys (recently , 
e.g. iBovle eUpHOQOt lUeda et all 120031 ICroom et all 120041: 
IRichards et al.ll2005l: iHasinger. Mivaii. & Schmidtll200l sig- 
nificant in each case at ^ 6a), regardless of higher-order 
subtleties im plied by "luminosity-dependent d ensity evo- 
lution" (e.g., Hasinger, Mivaii, & Schmidt 2005, and refer- 
ences therein) models and other chang es in the detailed QLF 
shape as a function of redshift (e.g., IRichards et al.ll2006al: 
Hopki ns. Richards. & Hernquist 20071). 

That the expect ed L* from M50 agrees w ith the observations 
is not surprising. Hopki nset al.l d2006d) combined observed 
merger mass functions with a larg e suite of corresponding hy- 
drodynamical merger simulations (Robertson et al. 2005b) in- 
cluding black hole accretion and feedback, to calculate what 
the resulting merger-driven QLF should be in each band at 
each corresponding redshift. Their predicted merger-driven 
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FIG. 11. — Predicted location of the quasar luminosity function (QLF) 
break L* as a function of redshift, from the observed transition mass (black 
points show A/50 as m Figure [2] from all samples; dotted line shows best-fit 
trend), compared to the observed QLF break (colored points) from various 
studies. The mass A/50 is converted to a luminosity assuming that the charac- 
teristic luminosity L» of quasars traces their host masses, since these objects 
are all observed to be near-Eddington and the black hole-host mass relation 
evolves weakly with redshift to z < 1 (alternatively, convolving the mass dis- 
tribution being "added" to early-type populations in Figuref4]with some prob- 
ability of seeing the black hole in each system at a given luminosity extend- 
ing to ~ Lrjdd yields a similar L*). The QLF measurements shown from hard 
X-ray (red), soft X-ray (orange), and optical (green) are converted to bolo- 
metric lumin osities with the observationally de termined bolometric and dust 
corrections in Hopkins, Richards, & Hernquist (2007). The bolometric break 
luminosity directly fitted in Hopkins, Richards, & Hernquist 1 2007) from the 
compilation of the samples shown and ~ 30 other measured quasar lumi- 
nosity functions (see references therein) is also shown (blue). The observed 
break luminosity from all samples is consistent with the expectation of any 
model in which the objects in Figuref4](objects being added or moving to the 
early-type population) are associated with quasar "triggers," with the black 
holes appropriate for their stellar mass Mfo(z) briefly accreting near Edding- 
ton. 

QLF agrees well (x 2 /v 1) with that observed at every red- 
shift at which this comparison is possible, and the inverse 
(predicted merger MF from the QLF) also agrees well with the 
observations. Since we have shown that the buildup of the el- 
liptical MF and M50 trace a similar mass distribution to merg- 
ers, it is expected that the resulting L* will agree with t he QLF . 
However, our comparison here, unlike Hopki ns et al.l (|2006d), 
is model-independent, based only on the well-determined 
black hole-host mass relation and Eddington limit. 

6.2. The Transition Mass from the QLF Break 

In Figure [12] we invert this comparison, and estimate the 
characteristic "transition" mass expected based on the ob- 
served QLF. We first show points as in Figure QT| estimat- 
ing a characteristic host mass from the QLF (inverting 
Equation |6j. However, a proper calculation is not so triv- 
ial, as in detail M50 will be determined by the integrated 
buildup of the early-type MF (and late-type MF for M, r , M Q ), 
and therefore requires that we adopt some model for quasar 
light curves and triggering. Hopkins et al. (2006a,b) use 
a large set of several h undred hydrodynamical simulations 
dRobertson et al. 2005a b) of galaxy mergers, varying the rel- 
evant physics, galaxy properties, orbits, and system masses, 
to quantify the quasar lifetime (and related statistics) as a 
function of the quasar luminosity. They define the quantity 
?q(L|Mbh), i.e. the time a quasar of a given BH mass (equiv- 
alently, peak quasar luminosity ?q(L | L pea k)) will be observed 
at a given luminosity L. They further demonstrate that this 
quantity is robust across the wide range of varied physics and 
merger properties; for example, to the extent that the final BH 
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mass is the same, any "major" merger of sufficient mass ratio 
(less than ~ 5 : 1) will produce an identical effect. Given the 
tight black hole-host mass relation (Mbh-M*), it is trivial to 
write this as t Q (L\M*). Since at all L, t Q <C t H (the Hubble 
time), the observed QLF 4>q{L) is given by 

<t> Q (L,z) = [ t Q (L\M ^f}f*' f dlogM., (7) 
J at dlogM* 

where d$(M*)/df dlogM, (hereafter = <^>(M*,z)) is the rate 
of quasar "triggering" as a function of host spheroid mass at a 
given redshift. If the "trigger" is associated with formation of 
the spheroid or movement of a galaxy to the RS, then 0(M* , z) 
is the rate of buildup of the RS as a function of mass, directly 
comparable to that in Figure |4] 

Knowin g 4>o(L) direc tl y from observations and adopt- 
ing the Hop kins et all (l2QQ6allbb t Q (L\M*) (which is 
at least consistent with all quasar observational con- 
straints), the inversion of Equation Q yields cf>(M*,z). 
Hopkins, Richards, & Hernquist (2007) perform this inver- 
sion, using their large compilation of observed 4>q{L), and 
quote the best-fit </>(M»,z) (Table 5). We adopt their best- 
fit model for each redshift interval (individually; although a 
global fit yields similar results). If, again, this represents the 
buildup of the RS, then we can integrate from z — > oo to obtain 
the RS MF at all redshifts and calculate M 50 (z). The late-type 
MF is reasonably well-measured over the range of interest, so 
comparing it with this integration also yields an expected M tr 
and Mq. We compare these estimates with the observed M50 
in Figure[l2] and find they agree at all observed redshifts. 

Having obtained the rate of "buildup" of early type MFs ex- 
pected if each quasar "trigger" is associated with the forma- 
tion/movement of a RS galaxy, we can directly compare with 
the rate of buildup implied by observed early-type MFs. Fig- 
ure [13] plots the time-averaged buildup determined from the 
QLF, from the same cf>(M*,z) as Figure [12] compared with 
the observed buildup from Figure |U We consider both the 
mean time-averaged buildup (averaged over each appropriate 
redshift interval) assuming each quasar "trigger" is instanta- 
neously associated with the movement of a galaxy to the RS, 
and that expected if there is a uniform ~ 1 Gyr delay after each 
quasar before the galaxy becomes red (allowing time for e.g. 
gas exhaustion and reddening). In either case, this estimate 
agrees with the observed "buildup" of elliptical populations, 
at all masses and redshifts (x 2 / v ~ 1 at all z > 0.3). The latter 
(1 Gyr delay) case gives marginally better agreement, but the 
difference between the two is comparable to the uncertainties 
in either determination of <^>(M*,z) (see Figure PT21). At the 
highest masses at low redshifts (z < 0.3; although also to a 
lesser extent at z ~ 0.3-0.7), this estimate falls short of ob- 
served rates of "buildup." However, this is precisely where 
we have estimated that observed rates of "dry" mergers can 
account for early-type growth. Since gas-free mergers are not 
expected to trigger quasar activity, it is not surprising that this 
would not be implicit in quasar luminosity functions. Allow- 
ing for the contribution of dry mergers shown at z < 0.3 im- 
proves the agreement considerably (x 2 /v ~ 2). There is still 
some tension matching the observations near 10 n M Q , but it 
is important to note that at these redshifts, the cosmic variance 
associated with small volume, narrow-field galaxy surveys 
and, perhaps more importantly, with even wide-field quasar 
surveys (given the very low local space density of quasars), is 
largest. 

Having estimated the rate of quasar "triggers," <j>(M* , z), 



in Figure [T3j then if each such "trigger" is in fact a galaxy 
merger, we can convert this to an expected merger fraction in 
exactly the same manner as we converted the rate of early- 
type "buildup" in Figure [4] (i.e. simply assuming an observ- 
able merger timescale t mager ). This is shown in Figure|9] along 
with the observed additional contribution from dry mergers. 
Given the agreement with the rate of elliptical "buildup" in 
FigureQj] it is not surprising to find this agrees with observed 
merger fractions. 

As a caution, we should note that these calculations can give 
a misleading result if the full luminosi ty dependence of th e 
quasar "lifetime" from simula tions ([Ho pkins et al. 2 005allbL 
2006b ) and observat ions (e.g.. lAdelberger & Steidell 12005 bt: 
IVolonteri et alj2006l) is not properly taken into account. Such 
a case is not, of course, well-motivated physically, although it 
may represent alternative quasar feedback models (or a com- 
plete lack of such feedback), but it is nevertheless sometimes 
adopted for simplicity. Why should such a simplified model 
give a qualitatively different result? In the ?q(L|Mbh) model 
we consider, fg is larger at low luminosities, because low-level 
AGN activity can persist for a long time after the violent, sud- 
den high-accretion rate episode in a merger. Ignoring this lu- 
minosity dependence and assuming, e.g. that all quasars turn 
on and off (as "light-bulbs") for a short time implies that all 
observed quasars, even those at very low luminosity, are seen 
at (or very near) their "trigger," i.e. are in ongoing mergers. 
This gives a misleading estimate of the number of mergers 
needed to account for the QLF, and as a result yields an in- 
correct estimate of h ost luminosity functi ons and black hole 
mass functions (e.g jHopkins et aU 2006b d). as well as, con- 
sequently, erroneous estimates of the associated "transition" 
mass. 

6.3. Further Tests of This Association 

Having considered the Hopkins et al. ( 2006b) models of 
merger-triggered quasar lightcurves, we briefly note addi- 
tional future tests of these models and the generic associa- 
tion between the blue-red transition or elliptical formation and 
quasar activity. In Figure [14] we compare the observed QLF 
with the expected "conditional" QLF, i.e. the contribution to 
the QLF from hosts/merger remnants with different masses 
relative to the observed "transition" mass. In other words, the 
contribution 

A0(L)=f fi (L|M*)0(M*,z)AlogM* (8) 

from Equation (0. The QLF near corresponds to ob- 
jects with M* ~ M50. At the faintest and brightest lu- 
minosities, there are contributions from smaller and larger 
hosts, respectively (and a significant fraction of objects at the 
lowest luminosities w i ll not necessarily be merger-triggered; 
iHopkins & Hernquistl (l2006l) \ but it is clear from the figure 
that a direct measurement of the host masses of characteristic 
quasars at z should find their hosts dominated by objects with 
Mhost ~ M„- or M50, many of which should appear as relatively 
young ellipticals, if this picture is correct. 

In this scenario, one might also expect that objects around 
the "transition" mass preferentially host AGN. Since Equa- 
tion ^} implicitly defines the probability to see a galaxy with 
mass M* (BH mass Mbh ~ /iM*) at luminosity L, we can es- 
timate the fraction of such galaxies with a given Eddington 
ratio. Figure[l4]plots the fraction of galaxies hosting an AGN 
with an accretion rate (relative to Eddington) m > 0.1 as a 
function of galaxy stellar mass at several redshifts, compared 
to the observed "transition" mass at that z. (Note the actual 
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FIG. 12. — Observed "transition mass" (black points show M50 from Figure [3] from color-selected samples only, for clarity; dotted black line shows 
best-fit trend) compared with that pr edicted from qu asar luminosity functions. Blue stars show the observed QLF characteristic luminosity L„ from 
Hopkins, Richards, & Hernquist 12007) (see Figure [Til other samples not shown for clarity but trace a similar trend) directly converted to a characteristic 
"associated" mass given the observed Eddington ratios and black hole-host mass relation at these redshifts. Solid lines show the prediction of the simple as- 
sumption that the initial trigger of each bright quasar is associated with the form ation/addition of an early type galaxy. In detail, the compilation of quasar 
luminosity function data from z = - 6 in Hopkins, Richards, & Hernquist 1 2007, and references therein) is converted therein to a mass-dependent "rate" of 
early-type formation/additon, given this assumption and the model Eddington ratio distributions (light curves) from Hopkins et al. (2005a-e, 2006a-d). Thin lines 
use different model fits of this "rate"' to the QLF dat a (and give an approximate idea of the uncertainties in this empirical modeling) to calculate M50, with the 
thick line adopting the best-fit to the quasar data (see Hopkins, Richards, & Hernquist 2007, Table 5). Open circles and dashed line calculate M, r (adopting the 
observed late-type galaxy mass functions from Bell et al. (2003, z=0.05), Borch et al. (2006, z=0. 2-1.0), and Fontana et al. 12004, z=1.0-1.7)) instead of M50, 
with the same method. Red dotted line calculates M50 in this manner, but instead adopts an unph ysic al "light bulb" quasar light curve model (alternatively, this 
assumes that all observed low-luminosity quasars are in ongoing mergers/quenching). As in FigurefTTJ the QLF break appears to trace the same parent population 
mass and evolution with redshift as M tr IM^Q, consistent with quasar triggering and the buildup of black hole mass being associated with the buildup of spheroid 
populations. 

AGN luminosity will be mLEdd(A^BH)-) This particular pre- 
diction is sensitive to the different fits to the "triggering" rate 
4>(M* , z) provided in Hopkins, Ric hards. & Hernquistl d2007l) . 
especially at low mass (M <C M50), but the trend that the 
peak/turnover in this distribution tracks the "transition" mass 
is robust. 

7. THE TRANSITION MASS AND THE HALO QUENCHING MASS 

In most semi-analytic models, gas infalling in dark matter 
halos is shock-heated to the virial temperature, and, in low 
mass halos, subsequently cools on a short timescale, allow- 
ing rapid accretion onto the central halo galaxy and defining a 
"rapid cooling" or "cold accretion" regime. However, in mas- 
sive halos, the cooling time is longer and gas forms a quasi- 
static "hot" halo, d efining a "static hot ha l o" or "hot accre- 
tion" regime (e.g., iRees & Ostrikerl Il977t iBlumenthal et alj 
1984). More recently, it has been suggested that the transi- 
tion between these regimes is sharp, near a halo mass M cr j t ~ 
10 12 M Q (although this number is uncertain by a factor of 
sever al) at low redshift ( Birn boim & Dek el 2003; Kere s et all 
2005), and that suppression of future cooling and accretion is 
very efficient, es sentially "cutting off" all gas supplies above 
this mass (e.g., iDekel & Birnb oim 2006) and "quenching" 



star formation. 

The shock-heating of infalling gas need not be the specific 
physical agent of this "quenching": for example "radio mode" 
or low-luminosity, contin uous AGN feedback dCroton et alj 
2006; Cattaneo et alj 120061) or cyclic, short-lived quasar ac- 
tivity dBinnevll2004h may be invoked to maintain the gas in 
the "hot" phase. There is therefore a potentially important 
distinction between semi-analytic models (SAMs) which as- 
sume that the feedback mechanism is "at ready," such that 
upon crossing the critical mass M alt , star formation and gas 
accretion onto the central galaxy is instantaneously termi- 
nated, and those that require some additional mechanism or 
process (such as the formation of a relatively massive bulge 
and black hole) to drive the blue-red transition and transfor- 
mation/movement of galaxies to the RS. 

This essentially relates to the important distinction, dis- 
cussed in § [TJ between the mechanism by which galaxies be- 
come red/elliptical and that by which they maintain their col- 
ors/low star formation rates. The key value of invoking this 
"hot accretion" regime in SAMs has been the ability to sup- 
press star formation on timescales of order the Hubble time. 
However, although this could, in principle, be necessary to 
yield red galaxies at z = 0, it does not automatically follow 
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FIG. 13. — The time-averaged rate of buildup of early-type populations, as in Figure[4] compared to that implied by the quasar luminosity function (QLF) if 
every quasar "trigger" is associated with the movement/formation/transition to the red seque nce of a corresponding (M* ~ 10 3 Mbh ) spheroid (red lines). The 
functional form for the QLF and implied "triggering rate" is taken from the best-fit given in Hopkins, Richards, & Hernquist 12007). Lower line in each panel 
assumes instantaneous reddening, upper line a 1 Gyr delay (post-quasar peak) before objects appear on the red sequence. The implied rate, if quasars and the 
blue-red transition are associated with the same event, agrees well with the buildup of elliptical populations at all masses at moderate and high redshifts. At 
low redshift z < 0.3 (and to a lesser extent, at z ~ 0.3-0.7), the implied rate from the QLF falls below the observed buildup at high masses. The deficit can be 
accounted for at the highest masses with the observed rate of "dry" mergers (which, by definition, will not generally trigger quasar activity). 
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FIG. 14. — Left: Predicted contribution to the total B-band QLF (dotted, from the compilation of Hopkins, Richards, & Hernauist (2007)) from host 
galaxies in different mass intervals rel ative to the observed transition mass as labeled, from the models of Hopkins et al. (2005a-e, 2006a-d; see Table 5 of 
Hopkins, Richards, & Hernquist ( 2007)) (shown at z = 0.5, but qualitatively similar at all redshifts of interest). Right: Predicted fraction of galaxies hosting an 
AGN with an accretion rate (relative t o Eddington) m > 0.1, as a function of mass, at different redshifts as labeled. Solid lines show the best-fit model from 
Hopkins, Richards, & Hernquist 1 2007, Table 5), dotted lines the 2a minimum allowed contribution to the QLF from low-M* hosts. Points of the corresponding 
color show the observed "transition" mass at each redshift (from the best-fit trend; dashed line in Figure[2] error bars show approximate dispersion from different 
"transition" mass definitions). Although systematic uncertainties at low masses are large, the predictions above - that quasar ~ L* hosts and a turnover or peak 
in the "active" galaxy fraction should correspond to ~ M tr hosts - are robust expectations of models which associate a blue-red transition and quasar activity. 
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that it is sufficient. In other words, there may be other pro- 
cesses (e.g. mergers and/or quasars) which drive the blue-red 
transition and movement to the RS, and the "hot accretion" 
mode simply maintains these galaxies at their low star forma- 
tion rates. 

One possible interpretation of the observed "transition 
mass," perhaps the most naive, is that the transition mass sim- 
ply represents the stellar mass hosted in M cl -i t halos at each 
reds hift. If we adopt the expec ted halo quenching mass M cl j t 
from lDekel & Birnboiml d2006l) . and either assume the galax- 
ies hosted have the same stellar mass as those in z = h alos 
of the same mass (measured in Mandelbaum et al. 2006|), or 
that they are already fully assembled (i.e. have stellar masses 
at z appropriate for what their halo mass will be at z = 0), we 
can compare with our observed "transition mass." We find 
that while the two are similar at low redshifts, they diverge at 
higher-z. This is, of course, where the observations are most 
uncertain, so it may simple reflect a systematic error in our es- 
timation of the "transition mass." But it probably also reflects 
the possibility that, in these models, the "transition mass" has 
a more complex physical origin than simply tracing M cl -i t . As 
noted in § |3] allowing for more complex and realistic distri- 
butions of galaxies in transition to the RS can affect quanti- 
ties such as the "break" mass M* and "transition" mass in a 
non-trivial manner. But there is also the possibility that the 
"transition" to the RS requires additional processes beyond 
the initial cutoff of new gas supplies in "hot mode" accretion, 
such as gas exhaustion, mergers, and/or quasar activity to op- 
erate, which are what we see traced by the observed transition 
mass. 

It is also worth considering whether or not the flow of 
galaxy host halos across M cr i t is consistent with the number 
and mass densities of "transition" objects and the buildup in 
early-type populations which we have estimted from the ob- 
servations. The rate at which halos cross a given mass thresh- 
old M is straightforward to calculate in linear theory from the 
Press-Schechter formalism, 



F(>M|z) = erfc 



/ ^coii(z) 
V\72o-(M) 



(9) 



and adopting either a simple mean stellar-to-dark matte r mass 
ratio (from the calibration of Mandelba um et al.1 l2006). or in- 
tegrating (at this halo mass) over the population of inferred 
hosted galaxies from the conditional galaxy mass functions 
(CMFs; i.e. probability that halos of mass M host galaxies 
of stellar mass M») yields an estimate of the rate at which 
stellar mass crosses this threshold. The local CMF is deter- 
mined (albeit indirectly) entirely from observatio ns of galaxy 
mass/luminosity functions and clustering (e.g., lYang et all 
120031 120051: IZheng et al.1 120051) . and has been s ubsequently 
measu red directly in 2dFGRS group catalogues by lYang et al.1 
(120051) . and is well-constrained with typical uncertainties 
smaller than or comparable to those in our estimate of the rate 
of early-type "buildup" (at least forM* > 10 I0 M Q of interest 
here; the MF at lower masses depe nds on the mass t hreshold 
for inclusion of satellite systems). lYan et all d2003l) : ICooravl 
d200l [2006b extend the CLF/CMF to high redshifts (z < 4) 
using a large number of luminosity function and cluster i ng es- 
timate s fr om wide-area s urveys. Note that I Yang etail (I2003L 
2005) and lCooravl 12006) actually measure the conditional lu- 
minosity function (CLF); we convert to a CMF using the ap- 
propri ate M/L ratios as a function of mass from Be ll et al.1 
(2003). and assume these M/L values evolve with redshift fol- 
lowing the best-fit stellar population models as a function of 



mass from lGallazzi et all d2006l) and lRenzinild2006l). Check- 
ing directly (following the meth odology o f Yang et al] |2003l) 
shows that this agrees with the lBell etal]d2003h mass func- 
tions (see also their Figure 19) and the IlI et aU<[2006) mea- 
sured clustering as a function of stellar mass, and further- 
m ore, these CMFs ag ree well with those directly determined 
in lZheng et alj (2005). Ultimately, there are a number of sys- 
tematic (factor ~ 2-3) uncertainties in this comparison, and 
our (admittedly crude) empirical calculation ignores the fact 
that, in "quenching" models, crossing the quenching thresh- 
old itself may change the stellar-to-dark matter mass ratios 
and stellar M/L values. However, within these rather large 
uncertainties, our purely empirically estimated rate at which 
galactic host halos cross M cl ; t is consistent with the possibility 
that this is a necessary prerequisite for "transition" to the RS. 

8. CLUSTERING: AN INDEPENDENT TEST 

We compare the populations we have considered in an inde- 
pendent manner by examining their clustering properties. If a 
population (i.e. a given set of "parent" halos) clusters with a 
given bias b(z) at some redshift z, then the subsequent evolu- 
tion in their bias is trivially calculated in linear theory 

b{z = Q) = l+D(z)[b(z)-l], (10) 

where D(z) is the growth factor dCroom et alJl200 lh . regard- 
less of the processes (accretion, mergers, etc.) that affect the 
halos (and galaxies) themselves. The bias of galaxies (specif- 
ically red/elliptical galaxies) as a function of stellar mass is 
well-determined a t z = (we ado pt the recent determination 
from the SDSS in lLi et alj|2006l with typical < 10% uncer- 
tainty), so given the bias of a population at z and evolving 
it to z = with Equation (fTOb yields the characteristic z = 
stellar mass of this population (i.e. the average stellar mass of 
which the population is the "parent"). 

Figure[l6]shows this M», calculated from various clustering 
measurements b{z) of quasars, mergers (ULIRGs and SMGs), 
and E+A galaxies as a function of redshift, and compares to 
the "transition" mass Mso at each redshift. We invert this as 
well; knowing Mso(z), evolve b(M 5ai z = 0) with Equation ( fTOb 
to estimate b(z). Note that all b(z) shown from measurements 
are converted from the directly observed clustering length ro, 
which for a power law correlation function yields 



r = r (z = Q)[bD(z)] 



7/2 



(ID 



with rp(z = 0) ~ 5/z _I Mpc and 7 ~ 1.8 (see, e.g., 
iNorberg eT al. 2002). The absolute value of the bias as plotted 
is then weakly dependent on cosmology (and this conversion, 
of course, is inexact), but the important point for our purposes 
is that the relative bias of all points plotted (and b(M* ,z = 0) 
with which we compare) is insensitive to the cosmology. 

These comparisons do assume that the stellar mass of indi- 
vidual systems does not change much from z to z = 0, i.e. that, 
once formed, ellipticals are passively evolving. However, 
we can easily eliminate this assumption, by considering the 
clustering directly observed for red galaxies of mass Mso(z) 
at that redshift, and Figure [16] shows this as well. In either 
case, the agreement with the clustering of quasars and (albeit 
much less well-constrained) merger/E+A populations is good. 
This also agrees with determinations of e.g. the typical over- 
densities and small-scale clustering of quasars and ULIRG s 
dFarrah et al.ll2004t iHennawi et alJl2006t ISerber et al.ll2006h . 

This method by which we compare clustering is only 
weakly dependent on cosmology, through the growth fac- 
tor D(z) (independent of e.g. a%). There are some caveats, 
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FIG. 15. — Left: The "transition" mass determined by various definitions (M50 from all samples, M tr from color-selected samples, and Mq from color-selected 
samples shown as black, purple, and orange points, respectively, as in Figure 0, as a function of redshift. Dotted lines show the characteristic mass M* of all, 
red, and blue galaxies (green, red, and blue, respectively; see Figure|5J. The null hypothesis that "transition" mass objects (as well as quasars and mergers; see 
right panel) are drawn randomly/uniformly from the all, red, or blue galaxy population can be ruled out at > 6<r (> 5<x for quasars, > 3<r for mergers). Right: 
The "transition" mass, as in the left panels, together with the characteristic masses M„ of merger mass functions (red points; as in right panel of Figure|7J and 
quasar hosts (blue stars; as in Figure TT2l . The observations at all redshift are consistent with the hypothesis that mergers, quasars, and the transition/addition to 
the red sequence are associated with the same event. 
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FIG. 16. — Left: Observed "transition" mass (black points show M50 as in Figure[2] from all samples; black dotted line shows best-fit trend) as a function of 
redshift, compared with the characteristic host masses (colored points) of quasars, recently-formed elliptical (E+A/K+A) galaxies, and ongoing bright mergers 
(ULIRGs/SMGs), estimated from their clustering properties. Right: Corresponding bias as a function of redshift. Black points as in left panel show b(z) calculated 
from Mso(z) and the observed z = bias ILi et all 2006) for that stellar mass (i.e. assuming passive evolution), open black s quares show b(z) determined directly 
from observations for red galaxies with the appropriate M ^(z) (Mjo(z) from best-fit trend; points of increasing redshift from Li et al. 2006; Shepher deTamOOll; 
Polio et al. 2006; Meneux et al. 2006; Brown et al. 2005, respectively). Colored points show b(z) observed for the quasar, E+A, and merger populations, as 
labeled. Note that b(z) is defined from the clustering length ro and thus the relative bias and M t shown are only weakly cosmology-dependent. This provides a 
completely independent check of the previous comparison between these populations, but one which suggests a similar co-evolution. 
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however. Technically, we are estimating the mass which 
has exactly the observed bias; this is some weighted mean 
mass. However, theoretical expecta tions from physic ally mo- 
tivated quasar light curve models dLidz et al.l 120061) and di- 
rect observations of clustering as a function of luminosity 
( Adel berger & Steidell2005bt ICroom et alJ2005t iMyers et all 
12001 " suggest that quasar clustering depends only weakly 
on luminosity, reflecting a reasonably well-defined char- 
acteristic host mass. These comparisons will also, of 
course, be affected if the clustering of mergers is differ- 
ent on large scales from that of non-merging halos of the 
same mass (a so-called "merger bias"). However, a num- 
ber of investigations have found no such dependence (e.g. , 
Lemson & Kauffmanr] 119991; iKauffmann & Haehneltl 120021; 
Percival et al J 120031) and even where more rece nt investiga- 
tions have seen such an effect dGao et a l. 2005) it has been 
restricted to small mass halos (below the "collapse mass", 
i.e. where b = 1) at z = 0, and therefore the assumption of 
no merger b ias has generally been adopted in quasar cluster- 
ing studies (Martini & Weinb ergll200U iHaimari & Huill2001t 
Adelb erger & Steidelll2005bt ICroom et alj|2005t IMyers et ail 
2006; lLidz et alJl2006l) . 

We can repeat t his comparison using the formalism of 
iMo & White! ([1996) from linear collapse theory, which yields 
a characteristic halo mass from a given observed b{z). We 
use the observed stellar mass-halo mass relations calibrated 
for elliptical galax i es fro m weak lensing measurements in 
Mandelbau m et all (2006) to convert these halo masses to a 
stellar mass M, r . We convert between halo mass and bias 
with the method o f IMo & White! (1996) modified following 
ISheth et al.l §001) in our adopted cosmology (in detail as- 
suming og = 0. 8, n s = 0.98) with the p ower spectrum com- 
puted following Eisenstei n & Hul dT999). The results are sim- 
ilar, but are much more sensitive to the adopted cosmology 
and systematics in the stellar mass-halo mass relation in this 
approach. 

In considering the clustering of M50 objects, we have con- 
sidered the directly measured bias of objects with mass Mso(z) 
at redshift z, as well as the "passively evolved" clustering from 
the z = bias as a function of mass. We can gain further in- 
sight into the evolution of these populations by comparing the 
two. Knowing the observed bias of Mso(z) objects at z, we 
can evolve this to z = given Equation d 1 0b . and then use 
b(M*z = 0) to obtain the typical stellar mass hosted by these 
systems at z = 0. Comparing that to their stellar mass at z, 
namely Msq{z), shows by how much the typical stellar mass 
of the population has grown. We could also estimate this in 
a more indirect fashion, using linear theory to estimate a host 
halo mass Mh a io(z) given b(z), then knowing the z = mass 
of a halo with mass M\ rd \ {z) at z, use the local galaxy stellar- 
halo mass calibrations from lMandelbaum et al.l (120061) to ob- 
tain M*(z = 0). Again, this approach is considerably more 
sensitive to the assumed cosmology, but in our adopted case 
yields similar results. 

In Figure [T7] we use this to compare Mso(z), the stellar 
mass of "transition" mass objects at z, with M»(z = 0), i.e. 
the mean z = stellar mass which is typically hosted by the 
evolved "parent halos." Unless "transition" mass objects com- 
prise some unusual outlier in their halo properties, this should 
represent the typical stellar mass these objects will grow to 
by z = 0. We c ompare with the expectation, following e.g. 
iBell et all (l2006al) . that these stellar masses grow at a rate cor- 
responding to one major (mass ratio 1:1) merger since z = 1. 
We also consider the case if the stellar mass in these objects 



grows in fixed proportion with their host dark matter halos. 

The systematic uncertainties (and measurement errors in 
b(z)) are sufficiently large that we should regard these com- 
parisons with caution, and not consider this as evidence for a 
particular "amount" of dry merging. However, the estimated 
M»(z = 0) demonstrates that the M^(z) measurements are 
completely consistent with subsequent growth by dry merg- 
ers at observationally inferred rates. Growth in proportion to 
the host halo mass, by contrast, is extremely difficult to recon- 
cile with observed properties of the galaxies. This is not sur- 
prising, as the existence of any significant ~ 10 12 M Q galaxy 
population at z = 1 without a corresponding ~ 10 13 - 10 14 M Q 
galaxy population at z = implies that, at least for some ob- 
jects assembled most rapidly, subsequent galaxy assembly 
must lag behind halo growth (or subsequent growth in these 
halos must be anomalously slow). This does mean, however, 
that it is not possible to reconcile the observations with a 
model in which galaxy assembly uniformly tracks halo as- 
sembly, even allowing for the final galaxy stellar-halo mass 
ratio to be a function of halo mass (i.e. setting in all progen- 
itors the effective M/L of the z = halo, which then simply 
assembles). 

9. SUMMARY & DISCUSSION 

We compile a large number of observations of red/elliptical 
galaxy mass functions, and use these to determine the rate of 
"buildup" of the red sequence (RS) as a function of mass and 
redshift. Comparing these with observations of other pop- 
ulations allows us to test a number of different models for 
the possible associations between these populations and the 
"transition" of galaxies from blue, star-forming disks to red, 
"dead" ellipticals. 

Independent of the nature of "downsizing" in the buildup 
of RS mass functions (discussed below), the rate of RS 
"buildup" is sufficiently well-determined to place meaning- 
ful constraints on a number of models. Dissipationless (gas 
poor, red, or "dry") mergers can account for the buildup of 
the RS at only the largest masses > 10 n M Q at low redshift 
(z < 0.3). At higher redshifts (z > 0.5), the dry merger rate 
would have to be at least an order of magnitude larger than 
observationally estima te d (Ivan Dok kum 2005; Masiedi et al. 
l2006tlBell et al.l2006allbl;lLotz et alj2006bl) to account for ob- 
served RS buildup, even at the highest masses. This is perhaps 
unsurprising, as these and other observations find the gas-rich 
merger rate/fraction is an order of magnitude or more larger 
at all but the lowest redshifts. Furthermore, the total mass 
density on the R S is observed to increase by a factor ~ 2.5 - 3 
since z ~ 1 (e.gjBundv et all2 005a: Frances chini et alJ2006l : 
Pann ella et al.ll2006l;lBorch et al.l l2006), and dry mergers can- 
not, by definition, move/form "new" galaxies and mass on the 
RS. 

However, we find the total observed merger population (gas 
rich+gas poor) agrees very well with that expected if all RS 
galaxies are formed in mergers. Both the detailed mass distri- 
bution and fraction/rates of galaxy mergers are consistent with 
the rate of RS buildup at all masses and redshifts observed. 
This merger population is dominated by gas-rich mergers 
at all masses at high redshifts (z > 0.5 IBell et all [2006b; 
lLotz et alJ2006bh and at low masses at low redshifts, morpho- 
logically identifiable as bright (i.e. star-forming or starburst- 
ing) i nteracting systems (e.g.. iBundy et alJl2006t IWolf et alJ 
120051) . In detail, completely neglecting dry mergers (or 
merger mass functions sensitive to them), this agreement is 
unchanged except for the highest masses at low-z discussed 
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FIG. 17. — The best-fit "transition" stellar mass Mso(z) ( see Figureff] dotted lines with blue shaded range show ~ lc range) at a given redshift, compared to 
the inferred stellar mass of the same objects at redshift z = (red shaded range). The observed bias of a fixed set of objects (halos) with stellar mass M^q(z), 
b(Mso(z), z), evolves simply to z = (Croom et al. 2001), where a comparison with the local b(M±, z = 0) yields the typical stellar mass h osted by such halos . 
Dashed lines s how the result of taking M^o(z) (dotted lines) and allowing for a constant rate (~ 1 since z = 1, as suggested by observations; van Dokkum 2005; 
Bell et al. 2006a) of major (equal-mass) "dry" mergers. Dash-dotted line assumes that subsequent (aft er z) galaxy assembly tracks halo assemb ly (i.e. efficient 
dry merging; no "downsizing" in assembly times), as implied by some semi-analytic models (e.g., De Lucia & Blaizot 2006; Cattaneo et al. 2006, note this is not 
a rigorous comparison with these models). Although systematic uncertainties are sufficiently large that this should not be considered evidence for a particular 
amount of dry merging, the comparison demonstrates that the evolution in Mso(z) is completely consistent with observational evidence for a significant growth 
by dry mergers since z > 1 . A very large number of dry mergers is, however, inconsistent, and this also violates the direct observationally determined rates of dry 
mergers above (galaxies grow by a factor > 2 since z, = 1). Note that dry mergers do not, by definition, "build up" the total mass budget on the red sequence, and 
at observed rates, have a relatively weak effect on the RS MF near the "transition" mass (see also Figuref4). 



above. There is substantial systematic uncertainty in convert- 
ing a merger fraction to a merger rate; our comparisons as- 
sume a characteristic observable merger timescale ~ 0.5 Gyr. 
However, this is a theoretically reasonable timescale (see §|4), 
and given the scatter in the observations, our conclusions are 
not changed for systematic shifts within a factor < 2, nor for 
allowing the merger timescale to scale with halo dynamical 
times (oc (1 +z)~ 3//2 ). Furthermore, this has no effect our com- 
parison of the mass distributions of these populations. 

Similarly, we find the rate at which host galaxies trigger 
quasars, determined as a function of the host stellar mass and 
redshift from the quasar luminosity function, agrees well with 
the observed RS buildup at all masses and redshifts observed. 
There is some discrepancy at the lowest redshifts and high- 
est masses, but this is again where the dry merger contribu- 
tion can account for the observed buildup, and dry mergers 
(by definition being gas-poor or gas-free) are not expected to 
trigger quasar activity. We consider this comparison first in 
a purely empirical fashion, using observed quasar Eddington 
ratios and the black hole-host mass relation to estimate quasar 
host masses as a function of redshift, and then in greater de- 
tail adopting the models of quasar light curves and lifetimes 
as a function of luminosity and host properties from the sim- 
ulations of merger-induced quasar activity in iHopkins et al.l 
(2006b). The latter introduces some model dependence (al- 



though it is consistent with the Eddington ratio and black 
hole-host mass relation estimates), but allows us to consider 
this comparison in greater detail and to make specific predic- 
tions for the characteristic host masses of quasars as a function 
of their position on the QLF and for the AGN or "active" frac- 
tion of galaxies as a function of stellar mass. In either case, the 
agreement between the rates of quasar formation/triggering as 
a function of host stellar mass and the buildup of RS galaxies 
is similar. 

We independently test these possible associations by com- 
paring clustering measurements of the relevant populations as 
a function of redshift, and find similar results. The clustering 
of quasars and systems "in transition" to the RS agree at all 
redshifts as if they trace the same mass distribution. Cluster- 
ing properties of merger (ULIRG and SMG) and post-merger 
(E+A) populations are consistent, but considerably less well- 
constrained. 

Although the above comparisons do not technically depend 
on it, we determine the "transition" mass (M, r , Mq), i.e. the 
mass which separates the blue, star-forming disk and red, non 
star-forming elliptical popu lations, as a funct ion of redshift. 
It has been suggested (e.g., iBundv et al.ll2006l) that this rep- 
resents the characteristic mass at which galaxies are form- 
ing on or being added to the RS as a function of redshift, 
but quantified in this manner, it is not obviously so (see, e.g. 
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IShankar et alj|2006|). We therefore also determine (M50, fol- 
lowing Cimatti et al. 2006) the minimum mass above which 
the RS mass function is > 50% assembled at a given red- 
shift. Regardless of definition, and furthermore regardless of 
the criterion used to separate early and late-type populations 
(whether e.g. a color, star formation rate, or morphology cri- 
terion), M tr IMQlMsd shift to systematically larger masses at 
higher redshift (significant at > 6er), tracing a very similar 
trend as a function of redshift. 

This trend, especially in M50 (which is independent of 
possible evolution in late-type mass functions) suggests that 
"downsizing" applies not just to galaxy star formation, but 
also in some sens e to galaxy assembly, as s uggested by 
the stu d ies of e.g. iBundv et al.l (l2005al 120061): IZucca et al.l 
d2005l): lYamada et al.| d2005l): iFranceschini et all d2006l): 
Cimat ti et al.l (12006b : iFontana et al.l ( 12006b : iBrownetall 
(120061) . In greater detail, considering the full rate of 
RS buildup as a function of stellar mass and redshift, 
low mass (< lO n M0) galaxies appear to be building up 
rapidly /continuously at low redshifts (~ 7- 15% per Gyr), but 
the most massive systems do not (~ 1% per Gyr growth at 
Z ~ 0). The growth of the most massive systems instead ap- 
pears to be rapid at significantly higher redshifts (e.g. rising to 
~ 20-50% per Gyr by z ~ 1). Equivalently, the characteris- 
tic mass (Schechter function M„) defined by this "formation" 
rate appears (albeit at only ~ 2- 3 <r) to increase with redshift 
in a similar fashion to the "transition" mass. 

We compare the "transition" mass with the characteristic 
masses of mergers and quasars and again find they trace sim- 
ilar masses as a function of redshift, with "downsizing" ev- 
ident in all three populations (> 3a for mergers, > 6<r for 
quasars), further supported by their observed clustering. We 
compare with the characteristic (Schechter function) M„ of 
the entire, red, and blue galaxy populations, and rule out at 
high significance the possibility that "transition" mass objects 
are drawn uniformly from any of these populations. Even with 
the systematic uncertainties in this mass estimate, it is also 
clearly distinct as a function of redshift from the characteris- 
tic masses of e.g. cluster, radio galaxy, ERO, DRG , or LBG 
populations (see e.g. Figure 2 of lFarrah et all2006l) . 

These observations are all consistent with and suggest a 
scenario in which major mergers, quasars, and the transition 
from blue disk to red elliptical galaxies are associated. They 
do not inform us regarding, for example, whether gas exhaus- 
tion or stellar or quasar feedback is the specific mechanism 
for the reddening which accompanies the merger-driven mor- 
phological transformation and quasar episode. However, they 
support the hypothesis that mergers drive the transition from 
blue disks to red elliptical galaxies, terminating in decaying, 
feedback-driven bright quasar phases. The transition mass 
and break in the QLF appear to reflect the characteristic mass 
of gas-rich objects merging at a given redshift, which may 
build up the new mass on the red sequence at progressively 
lower masses at lower z as gas supplies are exhausted in more 
massive systems. 

That quasar host masses trace the "transition" mass and 
not, e.g. the blue galaxy population M* (see also Figure [15) 
rules out the possibility that quasar activ ity gene rica lly traces 
star formation, as variants of e.g. the [Granato e t al.l (l2004t) 
models might predict. Likewise, that quasar masses do not 
trace the red galaxy M*, as they would if quasar activity was 
long-lived or randomly (but uniformly as a function of mass) 
episodic in high-mass black holes. The former case would 
be expected from the low-level AGN activity invoked in e.g. 



ICroton et al.l d2006l) . if "radio" and optical or high-Eddington 
ratio quasar activity were associate d, but they are in fact gen- 
erally believed to be d istinct (e.g.. lHol2002l:IWhite et alj2006l: 
iKoerding et al.ll2006h . The lat ter case implie s strong limits 
on implementations of e.g. the iBinneyl d2004l) model, which 
seek to suppress cooling flows through sporadic but poten- 
tially high accretion rate AGN activity. 

Although we can rule out some alternatives to the merger 
scenario, there remain a number of viable variants of "quench- 
ing" models, in which crossing the critical halo mass M cr [ t 
and entering a "hot" accretion regime plays a key role in the 
"transition" to the RS. Especially given that some feedback 
mechanism is typically required, even in the "hot" accretion 
regime, to prevent the formation of cooling flows, it is easy 
to imagine a scenario in which, upon entering this regime, 
new infalling halo gas is shock-heated, but feedback from e.g. 
a central disk galaxy with a small black hole is inefficient, 
cold gas reservoirs remain large, and cooling flows can form. 
Thus the system will not redden until it subsequently under- 
goes a major merger, which morphologically transforms the 
system, rapidly exhausts the remaining cold gas reservoir, and 
triggers a quasar and builds up a massive black hole, inject- 
ing some level of feedback and enabling efficient future (e.g. 
cyclic AGN or radio-mode) feedback. The "hot" accretion 
regime may be a necessary prerequisite for feedback to ef- 
ficiently prevent subsequent cooling, and as discussed in § [7] 
our comparisons are all consistent with this possib ility. This is 
genera lly similar to the scenario assumed in e.g. lCroton et al.l 
(2006), although they do explicitly incorporate quasar light 
curves or feedback. Recognizing these distinctions (as op- 
posed to e.g. assuming a system simply "shuts down" upon 
reaching M or it) will probably have little effect on the z = pre- 
dictions of semi-analytic models, since in either case star for- 
mation will be effectively suppressed at relatively early time s 
in the most massive systems (see also ICattaneo et alT 2006). 
However, at higher redshifts when massive objects are still 
forming, the distinctions will almost certainly be significant. 

We note that none of our conclusions conflict with the hy- 
pothesis that, once formed, elliptical galaxies can continue 
to grow by dry mergers. However, they emphasize that the 
importance of such mergers is restricted to the most massive 
galaxies at low redshifts. Our results, even the steep evolution 
of M50 implying some "downsizing" in red galaxy assembly, 
are all consistent with (and in fact, marginally/avor) the rela- 
tively low observationally inferred dry merger rate (~ 1 major 
dry merger since z ~ 1). Essentially, "downsizing" in galaxy 
assembly as we have quantified it is not, strictly speaking, 
"anti-hierarchical." Massive galaxies still continue to build 
up their populations to the present; it is simply a statement 
that the relative rates of red galaxy formation/assembly de- 
crease or "slow down" at late times in the most massive sys- 
tems. This could be related to pure dark matter processes, 
for example the rapid evolution in large overdensities could 
simply exhaust the "supply" of galaxies with which to merge, 
or the cluster environments of massive systems at low red- 
shift attain sufficient circular velocities as to rapidly reduce 
merger rates (see also Neistein et al. 2006). The evolution of 
the "transition" mass may, alternatively, be a statement that 
galaxy assembly does not strictly trace halo assembly. There 
are a number of baryonic processes which make this possi- 
ble, as it simply requires that the effective baryon conversion 
efficiencies in galaxies be a function of time, or different for 
central vs. satellite systems. 

Improved measurements of early-type mass functions at 



22 



Hopkins et al. 



high redshift (z ~ 1), larger samples of mergers from which 
to construct merger mass functions, revised or direct determi- 
nations of high-redshift conditional mass functions, and direct 
observations of the masses of quasar hosts will substantially 
improve the constraints in this paper. Ultimately, the integra- 
tion of the merger and quasar host mass functions may en- 
able a purely observational comparison with the remnant, red 
galaxy mass function. Calibration of the observable merger 
"timescale" with realistic high resolution galaxy merger sim- 
ulations, i.e. calibration of selection efficiencies for observed 
merger fractions, can further remove the factor ~ 2 uncer- 
tainty in comparing the rates of elliptical buildup and ob- 
served merger populations. The association favored here be- 
tween mergers, quasars, and elliptical buildup also makes spe- 
cific predictions for the characteristic masses of E+A galaxies 
and quasar hosts as a function of redshift, which should be 
testable in future wide-field surveys. 

The scenario we have described does not, of course, imply 
that mergers, quasars, and remnant ellipticals will necessarily 
be recognizable as the same, singular objects at a given in- 
stant - in fact, si mulations which follow the transition through 
these stages (e.g., Hopkins et al. 2006c), predict that they will 
be seen as distinct phases in merger-triggered evolution, and 
observations tracking e.g. the associations between dynamical 



merger state and quasar activity (e.g., IStraughn et al. 1 120051) 
support this distinction. What we ultimately find evidence 
for here in the masses, luminosities, and clustering proper- 
ties of mergers, galaxies being "added" or "in transition" to 
the red sequence, and quasars is that they are drawn from the 
same "parent" population, and that this population is distinct 
from the "quiescent" all/red/blue galaxy population. Again, 
none of this strictly implies causality, but it does favor models 
which associate these populations with the same event, a natu- 
ral expectation if mergers of gas-rich galaxies trigger quasars 
and morphologically transform disks to spheroids, moving 
"new" mass to the early-type population and leaving an el- 
liptical, gas-poor, rapidly reddening remnant galaxy. 
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